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Preface 


This  book  is  a  compilation  of  much  of  the  material  I  used  for  various  game  theory 
courses  over  the  past  decades.  The  first  part,  Thinking  Strategically ,  is  intended 
for  undergraduate  students  in  economics  or  business,  but  can  also  serve  as  an 
introduction  for  the  subsequent  parts  of  the  book.  The  second  and  third  parts  go 
deeper  into  the  various  topics  treated  in  the  first  part.  These  parts  are  intended 
for  more  mathematically  oriented  undergraduate  students,  or  for  graduate  students 
in  (for  instance)  economics.  Part  II  is  on  noncooperative  games  and  Part  III  on 
cooperative  games.  Part  IV  consists  of  a  mathematical  tools  chapter,  a  chapter  with 
review  problems  for  Part  I,  and  a  chapter  with  hints  and  solutions  to  the  problems 
of  all  chapters.  Every  chapter  has  a  section  with  problems. 

The  material  draws  heavily  on  game  theory  texts  developed  by  many  others,  often 
in  collaboration.  I  mention  in  particular  Jean  Derks,  Thijs  Jansen,  Andres  Perea,  Ton 
Storcken,  Frank  Thuijsman,  Stef  Tijs,  Dries  Vermeulen,  and  Koos  Vrieze.  I  am  also 
seriously  indebted  to  a  large  number  of  introductory,  intermediate,  and  advanced 
texts  and  textbooks  on  game  theory,  and  hope  I  have  succeeded  in  giving  sufficient 
credits  to  the  authors  of  these  works  in  all  due  places. 


About  the  Second  Edition 

In  this  second  edition,  I  have  corrected  mistakes,  omissions,  and  typos  from  the 
first  edition,  and  tried  to  improve  the  exposition  throughout  the  book.  I  have  added 
extra  problems  to  some  chapters,  and  also  a  chapter  with  review  problems  for  Part  I. 
In  Chap.  6,  I  have  added  a  few  sections  on  auctions  with  incomplete  information. 
With  only  few  exceptions,  the  references  to  the  literature  are  now  collected  in  Notes 
sections,  which  conclude  every  chapter  in  the  book. 

This  second  edition  has  benefitted  tremendously  from  extensive  comments  of 
Piotr  Frackiewicz  and  Peter  Wakker.  The  list  of  people  from  who  I  received 
comments  also  includes  Krzysztof  Apt,  Maikel  Bosschaert,  Yukihiko  Funaki,  Ali 
Ihsan  Ozkes,  Mahmut  Parlar,  Thijs  Ruijgrok,  Steffen  Sagave,  Judith  Timmer,  Mark 
Voorneveld,  and  others. 
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Preface 


How  to  Use  This  Book 

Part  I  of  the  book  is  intended,  firstly,  for  undergraduate  students  in  economics  and 
business  and,  secondly,  as  preparation  and  background  for  Parts  II-IV.  Part  I  is 
preceded  by  Chap.  1,  which  is  a  general  introduction  to  game  theory  by  means  of 
examples.  The  first  chapter  of  Part  I,  Chap.  2  of  the  book,  is  on  zero-sum  games. 
This  chapter  is  included,  not  only  for  historical  reasons — the  minimax  theorem  of 
von  Neumann  (1928)  was  one  of  the  first  formal  results  in  game  theory — but  also 
since  zero-sum  games  (all  parlor  games)  require  basic,  strictly  competitive,  game- 
theoretic  thinking.  The  heart  of  Part  I  consists  of  Chaps.  3-6  on  noncooperative 
games  and  applications,  and  Chap.  9  as  a  basic  introduction  to  cooperative  games. 
These  chapters  can  serve  as  a  basics  course  in  game  theory.  Chapters  7  and  8  on 
repeated  games  and  evolutionary  games  can  serve  as  extra  material,  as  well  as 
Chap.  10  on  cooperative  game  models  and  Chap.  11,  which  is  an  introduction  to 
the  related  area  of  social  choice  theory. 

Although  this  book  can  be  used  for  self-study,  it  is  not  intended  to  replace  the 
teacher.  Part  I  is  meant  for  students  who  are  knowledgeable  in  basic  calculus,  and 
does  not  try  to  avoid  the  use  of  mathematics  on  that  basic  level.  Moreover,  (almost) 
all  basic  game  theory  models  are  described  in  a  formally  precise  manner,  although 
I  am  aware  that  some  students  may  have  a  blind  spot  for  mathematical  notation  that 
goes  beyond  simple  formulas  for  functions  and  equations.  This  formal  presentation 
is  included  especially  since  many  students  have  always  been  asking  questions  about 
it:  leaving  it  out  may  lead  to  confusion  and  ambiguities.  On  the  other  hand,  a  teacher 
may  decide  to  drop  these  more  formal  parts  and  go  directly  to  the  examples  of 
concretely  specified  games.  For  example,  in  Chap.  5,  the  game  theory  teacher  may 
decide  to  skip  the  formal  Sect.  5.1  and  go  directly  to  the  worked  out  examples  of 
games  with  incomplete  information — and  perhaps  later  return  to  Sect.  5.1. 

Parts  II-IV  require  more  mathematical  sophistication  and  are  intended  for 
graduate  students  in  economics,  or  for  an  elective  game  theory  course  for  students 
in  (applied)  mathematics.  In  my  experience,  it  works  well  to  couple  the  material 
in  these  parts  to  related  chapters  in  Part  I.  In  particular,  one  can  combine  Chaps.  2 
and  12  on  zero-sum  games,  Chaps.  3  and  13  on  finite  games,  Chaps.  4,  5,  and  14 
on  games  with  incomplete  information  and  games  in  extensive  form,  and  Chaps.  8 
and  15  on  evolutionary  games.  For  cooperative  game  theory,  one  can  combine 
Chap.  9  with  Part  III. 

Each  chapter  contains  a  problems  section.  Moreover,  Chap.  23  contains  review 
problems  for  Part  I.  Hints,  answers  and  solutions  are  provided  at  the  end  of  the  book 


Preface 


IX 


in  Chap.  24.  For  a  complete  set  of  solutions  for  teachers,  as  well  as  any  comments, 
please  contact  me  by  email. 

Maastricht,  The  Netherlands  Hans  Peters 

January  2015 
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Introduction 


1 


The  best  introduction  to  game  theory  is  by  way  of  examples.  This  chapter  starts 
with  a  global  definition  in  Sect.  1.1,  collects  some  historical  facts  in  Sect.  1.2,  and 
presents  examples  in  Sect.  1.3.  Section  1.4  briefly  comments  on  the  distinction 
between  cooperative  and  noncooperative  game  theory. 


1.1  A  Definition 

Game  theory  studies  situations  of  competition  and  cooperation  between  several 
involved  parties  by  using  mathematical  methods.  This  is  a  broad  definition  but  it 
is  consistent  with  the  large  number  of  applications.  These  applications  range  from 
strategic  questions  in  warfare  to  understanding  economic  competition,  from  eco¬ 
nomic  or  social  problems  of  fair  distribution  to  behavior  of  animals  in  competitive 
situations,  from  parlor  games  to  political  voting  systems — and  this  list  is  certainly 
not  exhaustive. 

Game  theory  is  an  official  mathematical  discipline  (American  Mathematical 
Society  Classification  code  91  A)  but  it  is  developed  and  applied  mostly  by 
economists.  In  economics,  articles  and  books  on  game  theory  and  applications  are 
found  in  particular  under  the  Journal  of  Economic  Literature  codes  C7x.  The  list  of 
references  at  the  end  of  this  book  contains  many  textbooks  and  other  books  on  game 
theory. 


1.2  Some  History 

In  terms  of  applications,  game  theory  is  a  broad  discipline,  and  it  is  therefore  not 
surprising  that  game-theoretic  situations  can  be  recognized  in  the  Bible  (Brams, 
1980)  or  the  Talmud  (Aumann  and  Maschler,  1985).  Also  the  literature  on  strategic 
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warfare  contains  many  situations  that  could  have  been  modelled  using  game  theory: 
a  very  early  reference,  over  2,000  years  old,  is  the  work  of  the  Chinese  warrior- 
philosopher  Sun  Tzu  (1988).  Early  works  dealing  with  economic  problems  are 
Cournot  (1838)  on  quantity  competition  and  Bertrand  (1883)  on  price  competition. 
Some  of  the  work  of  Dodgson — better  known  as  Lewis  Carroll,  the  writer  of 
Alice ’s  Adventures  in  Wonderland — is  an  early  application  of  zero-sum  games  to  the 
political  problem  of  parliamentary  representation,  see  Dodgson  (1884)  and  Black 
(1969). 

One  of  the  first  formal  works  on  game  theory  is  Zermelo  (1913).  The  logician 
Zermelo  proved  that  in  the  game  of  chess  either  White  has  a  winning  strategy  (i.e., 
can  always  win),  or  Black  has  a  winning  strategy,  or  each  player  can  always  enforce 
a  draw — see  Sect.  13.2.5.  Up  to  the  present,  however,  it  is  still  not  known  which 
of  these  three  cases  is  the  true  one.  A  milestone  in  the  history  of  game  theory  is 
Von  Neumann  (1928).  In  this  article  von  Neumann  proved  the  famous  minimax 
theorem  for  zero-sum  games.  This  work  was  the  basis  for  the  book  Theory  of 
Games  and  Economic  Behavior  by  von  Neumann  and  Morgenstern  (1944/1947), 
by  many  regarded  as  the  starting  point  of  game  theory.  In  this  book  the  authors 
extended  von  Neumann’s  work  on  zero-sum  games  and  laid  the  groundwork  for 
the  study  of  cooperative  (coalitional)  games.  See  Dimand  and  Dimand  (1996)  for  a 
comprehensive  study  of  the  history  of  game  theory  up  to  1945. 

The  title  of  the  book  of  von  Neumann  and  Morgenstern  reveals  the  intention  of 
the  authors  that  game  theory  was  to  be  applied  to  economics.  Nevertheless,  in  the 
1950s  and  1960s  the  further  development  of  game  theory  was  mainly  the  domain 
of  mathematicians.  Seminal  articles  in  this  period  were  the  papers  by  Nash  on 
Nash  equilibrium  (Nash,  1951)  and  on  bargaining  (Nash,  1950),  and  Shapley  on  the 
Shapley  value  and  the  core  for  games  with  transferable  utility  (Shapley,  1953, 1967). 
See  also  Bondareva  (1962)  on  the  core.  Apart  from  these  articles,  the  foundations 
of  much  that  was  to  follow  later  were  laid  in  the  contributed  volumes  edited  by 
Kuhn  and  Tucker  (1950,  1953),  Dresher  et  al.  (1957),  Luce  and  Tucker  (1958),  and 
Dresher  et  al.  (1964).  A  classical  work  in  game  theory  is  Luce  and  Raiffa  (1957): 
many  examples  still  used  in  game  theory  can  be  traced  back  to  this  source,  like  the 
Prisoners’  Dilemma  and  the  Battle  of  the  Sexes. 

In  the  late  1960s  and  1970s  of  the  previous  century  game  theory  became  accepted 
as  a  new  formal  language  for  economics  in  particular.  This  development  was  stim¬ 
ulated  by  the  work  of  Harsanyi  (1967/1968)  on  modelling  games  with  incomplete 
information  and  Selten  (1965,  1975)  on  (sub)game  perfect  Nash  equilibrium. 

In  1994,  Nash,  Harsanyi  and  Selten  jointly  received  the  Nobel  prize  in  economics 
for  their  work  in  game  theory.  Since  then,  many  Nobel  prizes  in  economics 
have  been  awarded  for  achievements  in  game  theory  or  closely  related  to  game 
theory:  Mirrlees  and  Vickrey  (in  1996),  Sen  (in  1998),  Akerlof,  Spence  and 
Stiglitz  (in  2001),  Aumann  and  Schelling  (in  2005),  Hurwicz,  Maskin  and  Myerson 
(in  2007),  and  Roth  and  Shapley  (in  2012). 


1  See  Nasar  (1998)  for  a  biography,  and  the  later  movie  with  the  same  title  A  Beautiful  Mind. 
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From  the  1980s  on,  large  parts  of  economics  have  been  rewritten  and  further 
developed  using  the  ideas,  concepts  and  formal  language  of  game  theory.  Articles 
on  game  theory  and  applications  can  be  found  in  many  economic  journals.  Journals 
explicitly  focusing  on  game  theory  include  the  International  Journal  of  Game 
Theory ,  Games  and  Economic  Behavior ,  and  International  Game  Theory  Review. 
Game  theorists  are  organized  within  the  Game  Theory  Society ,  see  http://www. 
gametheorysociety.org/. 


1.3  Examples 

Every  example  in  this  section  is  based  on  a  story.  Each  time  this  story  is  presented 
first  and,  next,  it  is  translated  into  a  formal  mathematical  model.  Such  a  mathemat¬ 
ical  model  is  an  alternative  description,  capturing  the  essential  ingredients  of  the 
story  with  the  omission  of  details  that  are  considered  unimportant:  the  mathematical 
model  is  an  abstraction  of  the  story.  After  having  established  the  model,  we  spend 
some  lines  on  how  to  solve  it:  we  say  something  about  how  the  players  should  or 
would  act.  In  more  philosophical  terms,  these  solutions  can  be  normative  or  positive 
in  nature,  or  somewhere  in  between,  but  often  such  considerations  are  left  as  food  for 
thought  for  the  reader.  As  a  general  remark,  a  basic  distinction  between  optimization 
theory  and  game  theory  is  that  in  optimization  it  is  usually  clear  when  some  action  or 
choice  is  optimal,  whereas  in  game  theory  we  deal  with  human  (or,  more  generally, 
animal)  behavior  and  then  it  may  be  less  clear  when  an  action  is  optimal  or  even 
what  optimality  means. 

Each  example  is  concluded  by  further  comments ,  possibly  including  a  short 
preview  on  the  treatment  of  the  exemplified  game  in  the  book.  The  examples  are 
grouped  in  subsections  on  zero-sum  games,  nonzero-sum  games,  extensive  form 
games,  cooperative  games,  and  bargaining  games. 


1 .3.1  Zero-Sum  Games 

The  first  example  is  based  on  a  military  situation  staged  in  World  War  II. 

1 .3.1 .1  The  Battle  of  the  Bismarck  Sea 

Story  The  game  is  set  in  the  South-Pacific  in  1943.  The  Japanese  admiral  Imamura 
has  to  transport  troops  across  the  Bismarck  Sea  to  New  Guinea,  and  the  American 
admiral  Kenney  wants  to  bomb  the  transport.  Imamura  has  two  possible  choices:  a 
short  Northern  route  (2  days)  or  a  long  Southern  route  (3  days),  and  Kenney  must 
choose  one  of  these  routes  to  send  his  planes  to.  If  he  chooses  the  wrong  route  he 


2Feyerabend’s  (1974)  ‘anything  goes’  adage  reflects  a  workable  attitude  in  a  young  science  like 
game  theory. 


4 


1  Introduction 


can  call  back  the  planes  and  send  them  to  the  other  route,  but  the  number  of  bombing 
days  is  reduced  by  1 .  We  assume  that  the  number  of  bombing  days  represents  the 
payoff  to  Kenney  in  a  positive  sense  and  to  Imamura  in  a  negative  sense. 

Model  The  Battle  of  the  Bismarck  Sea  problem  can  be  modelled  using  the 
following  table: 

North  South 

North  /  2  2  \ 

South  \  l  3  J 

This  table  represents  a  game  with  two  players,  namely  Kenney  and  Imamura. 
Each  player  has  two  possible  choices;  Kenney  (player  1)  chooses  a  row,  Imamura 
(player  2)  chooses  a  column,  and  these  choices  are  to  be  made  independently  and 
simultaneously.  The  numbers  represent  the  payoffs  to  Kenney.  For  instance,  the 
number  2  up  left  means  that  if  Kenney  and  Imamura  both  choose  North,  the  payoff 
to  Kenney  is  2  and  the  payoff  to  Imamura  is  —2.  Thus,  the  convention  is  to  let  the 
numbers  denote  the  payments  from  player  2  (the  column  player)  to  player  1  (the 
row  player).  This  game  is  an  example  of  a  zero-sum  game  because  the  sum  of  the 
payoffs  is  always  equal  to  zero. 

Solution  In  this  particular  example,  it  does  not  seem  difficult  to  predict  what  will 
happen.  By  choosing  North,  Imamura  is  always  at  least  as  well  off  as  by  choosing 
South,  as  is  easily  inferred  from  the  above  table  of  payoffs.  So  it  is  safe  to  assume 
that  Imamura  chooses  North,  and  Kenney,  being  able  to  perform  this  same  kind  of 
reasoning,  will  then  also  choose  North,  since  that  is  the  best  reply  to  the  choice 
of  North  by  Imamura.  Observe  that  this  game  is  easy  to  analyze  because  one  of 
the  players  (Imamura)  has  a  dominated  choice,  namely  South:  no  matter  what  the 
opponent  (Kenney)  decides  to  do,  North  is  at  least  as  good  as  South,  and  sometimes 
better. 

Another  way  to  look  at  this  game  is  to  observe  that  the  payoff  2  resulting  from 
the  combination  (North,  North)  is  maximal  in  its  column  (2  >  1)  and  minimal  in 
its  row  (2  <  2).  Such  a  position  in  the  matrix  is  called  a  saddlepoint.  In  such  a 
saddlepoint,  neither  player  has  an  incentive  to  deviate  unilaterally.  (As  will  become 
clear  later,  this  implies  that  the  combination  (North,  North)  is  a  Nash  equilibrium.) 
Also  observe  that,  in  such  a  saddlepoint,  the  row  player  maximizes  his  minimal 
payoff  (because  2  =  min{2,  2}  >  1  =  min{l,  3}),  and  the  column  player  (who  has 
to  pay  according  to  our  convention)  minimizes  the  maximal  amount  that  he  has  to 
pay  (because  2  =  max{2, 1}  <  3  =  max{2,  3}).  The  resulting  payoff  of  2  from 
player  2  to  player  1  is  called  the  value  of  the  game. 

Comments  Two-person  zero-sum  games  with  finitely  many  choices,  like  the  one 
above,  are  also  called  matrix  games  since  they  can  be  represented  by  a  single  matrix. 
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Matrix  games  are  studied  in  Chaps.  2  and  12.  The  combination  (North,  North)  in  the 
example  above  corresponds  to  what  happened  in  reality  back  in  1943. 

1.3.1 .2  Matching  Pennies 

Story  In  the  two-player  game  of  matching  pennies ,  both  players  have  a  coin  and 
simultaneously  show  heads  or  tails.  If  the  coins  match,  player  2  gives  his  coin  to 
player  1;  otherwise,  player  1  gives  his  coin  to  player  2. 


Model  This  is  a  zero-sum  game  with  payoff  matrix 


Heads  Tails 


Heads  /  1  — 1  \ 

Tails  V  -1  1  )  ‘ 


Solution  Observe  that  in  this  game  no  player  has  a  dominated  choice:  Heads  can  be 
better  or  worse  than  Tails,  depending  on  the  choice  of  the  opponent.  Also,  there  is 
no  saddlepoint :  there  is  no  position  in  the  matrix  at  which  there  is  simultaneously  a 
minimum  in  the  row  and  a  maximum  in  the  column.  Thus,  there  does  not  seem  to  be 
a  natural  way  to  solve  the  game.  One  way  to  overcome  this  difficulty  is  by  allowing 
players  to  randomize  between  their  choices:  player  1  chooses  Heads  with  a  certain 
probability  p  and  Tails  with  probability  1  —  p,  and  player  2  chooses  Heads  and 
Tails  with  probabilities  q  and  1  —  q  respectively.  From  considerations  of  symmetry, 
a  good  guess  would  be  to  suppose  that  player  1  chooses  Heads  or  Tails  both  with 
probability  ^ .  As  above,  suppose  that  player  2  plays  Heads  with  probability  q  and 
Tails  with  probability  1  —  q.  In  that  case  the  expected  payoff  for  player  1  is  equal  to 

I[<7  '  1  +  (1  -  <?)  •  -1]  +  I[<7  '  “I  +  (1  -  <?)  '  1] 

which  is  independent  of  q,  namely,  equal  to  0.  So  by  randomizing  in  this  way 
between  his  two  choices,  player  1  can  guarantee  to  obtain  0  in  expectation  (of 
course,  the  actually  realized  outcome  is  always  +1  or  —1).  Analogously,  player 
2,  by  playing  Heads  or  Tails  each  with  probability  can  guarantee  to  pay  0  in 
expectation.  Thus,  the  amount  of  0  plays  a  role  similar  to  that  of  a  saddlepoint. 
Again,  we  will  say  that  0  is  the  value  of  this  game. 


Comments  The  randomized  choices  of  the  players  are  usually  called  mixed  strate¬ 
gies.  Randomized  choices  are  often  interpreted  as  beliefs  of  the  other  player(s)  about 
the  choice  of  the  player  under  consideration.  See  also  Sect.  3.1. 
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1 .3.2  Nonzero-Sum  Games 

1. 3.2.1  Prisoners' Dilemma 

Story  Two  prisoners  (players  1  and  2)  have  committed  a  crime  together  and  are 
interrogated  separately.  Each  prisoner  has  two  possible  choices:  he  may  ‘cooperate’ 
(C)  which  means  ‘not  betray  his  partner’  or  he  may  ‘defect’  (D),  which  means 
‘betray  his  partner’.  The  punishment  for  the  crime  is  10  years  of  prison.  Betrayal 
yields  a  reduction  of  1  year  for  the  defector  (traitor).  If  a  prisoner  is  not  betrayed, 
he  is  convicted  to  1  year  for  a  minor  offense. 

Model  This  situation  can  be  summarized  as  follows: 

C  D 

C  (- 1,-1  -10, 0\ 

D  VO, -10  -9,-9 )  ' 

This  table  is  read  in  the  same  way  as  before,  but  now  there  are  two  payoffs  at  each 
position:  by  convention  the  first  number  is  the  payoff  for  player  1  (the  row  player) 
and  the  second  number  is  the  payoff  for  player  2  (the  column  player).  Observe  that 
the  game  is  no  longer  zero-sum,  and  we  have  to  write  down  both  numbers  at  each 
matrix  position. 

Solution  For  both  players  C  is  a  strictly  dominated  choice:  D  is  better  than  C, 
whatever  the  other  player  does.  So  it  is  natural  to  argue  that  the  outcome  of  this 
game  will  be  the  pair  of  choices  ( D ,  D),  leading  to  the  payoffs  —9,  —9.  Thus,  due  to 
the  existence  of  strictly  dominated  choices,  the  Prisoners’  Dilemma  game  is  easy  to 
analyze. 

Comments  The  payoffs  (—9,  —9)  are  inferior:  they  are  not  Pareto  optimal ,  the 
players  could  obtain  the  higher  payoff  of  —1  for  each  by  cooperating,  i.e.,  both 
playing  C.  There  is  a  large  literature  on  how  to  establish  cooperation,  e.g.  by 
reputation  effects  in  a  repeated  play  of  the  game.  If  the  game  is  played  repeatedly, 
other  (higher)  payoffs  are  possible,  see  Chap.  7. 

The  Prisoners’  Dilemma  is  a  metaphor  for  many  economic  situations.  An 
outstanding  example  is  the  so-called  ‘tragedy  of  the  commons’,  see  Problem  6.27  in 
this  book. 

1 .3.2.2  Battle  of  the  Sexes 

Story  A  man  and  a  woman  want  to  go  out  together,  either  to  a  football  match  or  to 
a  ballet  performance.  They  forgot  to  agree  where  they  would  go  to  that  night,  are  in 
different  places  and  have  to  decide  on  their  own  where  to  go;  they  have  no  means 
to  communicate.  Their  main  concern  is  to  be  together.  The  man  has  a  preference  for 
football  and  the  woman  for  ballet. 
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Model  A  table  reflecting  the  situation  is  as  follows. 

Football  Ballet 

Football  f  2,1  0,0  \ 

Ballet  V  °’°  1,2  J’ 

Here,  the  man  chooses  a  row  and  the  woman  a  column. 

Solution  Observe  that  no  player  has  a  dominated  choice.  The  players  have  to 
coordinate  without  being  able  to  communicate.  Now  it  may  be  possible  that  the  night 
before  they  discussed  football  at  length;  each  player  remembers  this,  may  think  that 
the  other  remembers  this,  and  so  this  may  serve  as  a  focal  point  for  both.  In  the 
absence  of  such  considerations  it  is  hard  to  give  a  unique  prediction  for  this  game. 
We  can,  however,  say  that  the  combinations  (Football,  Football)  and  (Ballet,  Ballet) 
are  special  in  the  sense  that  the  players’  choices  are  best  replies  to  each  other;  if  the 
man  chooses  Football  (Ballet),  then  it  is  optimal  for  the  woman  to  choose  Football 
(Ballet)  as  well,  and  vice  versa.  In  the  literature,  such  choice  combinations  are  called 
Nash  equilibria.  The  concept  of  Nash  equilibrium  is  without  doubt  the  main  solution 
concept  developed  in  game  theory. 

Comments  The  Battle  of  the  Sexes  game  is  metaphoric  for  problems  of  coordina¬ 
tion. 

1. 3.2.3  Matching  Pennies 

Every  zero-sum  game  is,  trivially,  a  special  case  of  a  nonzero- sum  game.  For 
instance,  the  Matching  Pennies  game  discussed  in  Sect.  1.3.1  can  be  represented 
as  a  nonzero-sum  game  as  follows: 


Heads  Tails 
Heads  /  1,-1  —  1 , 1  \ 
Tails  V-1,1  1,-lJ' 


Clearly,  no  player  has  a  dominated  choice  and  there  is  no  combination  of  a  row  and 
a  column  such  that  each  player’s  choice  is  optimal  given  the  choice  of  the  other 
player — there  is  no  Nash  equilibrium.  If  mixed  strategies  are  allowed,  then  it  can 
be  checked  that  if  player  2  plays  Heads  and  Tails  each  with  probability  ^ ,  then  for 
player  1  it  is  optimal  to  do  so  too,  and  vice  versa.  Such  a  combination  of  mixed 
strategies  is  again  called  a  Nash  equilibrium.  See  Chaps.  3  and  13. 

1 .3.2.4  A  Cournot  Game 

Story  Two  firms  produce  a  similar  (‘homogenous’)  product.  The  market  price  of 
this  product  is  equal  to  p  —  1  —  Q  or  zero  (whichever  is  larger),  where  Q  is  the  total 
quantity  produced.  There  are  no  production  costs. 
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Model  The  two  firms  are  the  players,  1  and  2.  Each  player  i  —  1,2  chooses  a 
quantity  q,  >  0,  and  makes  a  profit  of  Ki(q\ ,  q2)  =  g/(l  —  q\  —  qi)  (or  zero  if 
+  ^2  >  1)- 

Solution  Suppose  player  2  produces  q2  —  Then  player  1  maximizes  his  own 
profit  q\ (1  —  q\  —  \)  by  choosing  q\  =  |.  Also  the  converse  holds:  if  player  1 
chooses  q\  —  \  then  q2  —  |  maximizes  profit  for  player  2.  This  combination 
of  strategies  consists  of  mutual  best  replies  and  is  therefore  again  called  a  Nash 
equilibrium. 

Comments  This  particular  Nash  equilibrium  is  often  called  Cournot  equilibrium.  It 
is  easy  to  check  that  the  Cournot  equilibrium  in  this  example  is  again  not  Pareto 
optimal:  if  the  firms  each  would  produce  then  they  would  both  be  better  off.  The 
main  difference  between  this  example  and  the  preceding  ones  is,  that  each  player 
here  has  infinitely  many  choices,  also  if  no  mixed  strategies  are  included.  See  further 
Chap.  6. 


1 .3.3  Extensive  Form  Games 

All  examples  in  Sects.  1.3.1  and  1.3.2  are  examples  of  one-shot  games:  the  players 
choose  only  once,  independently  and  simultaneously.  In  parlor  games  as  well  as 
in  games  derived  from  real-life  economic  or  political  situations,  this  is  often  not 
what  happens.  Players  may  move  sequentially,  and  observe  or  partially  observe 
each  others’  moves.  Such  situations  are  better  modelled  by  so-called  extensive  form 
games. 

1 .3.3.1  Sequential  Battle  of  the  Sexes 

Story  The  story  is  similar  to  the  one  in  Sect.  1.3.2,  but  we  now  assume  that  the  man 
chooses  first  and  the  woman  can  observe  the  choice  of  the  man. 

Model  This  situation  can  be  represented  by  the  decision  tree  in  Fig.  1.1. 


1 


Fig.  1.1  The  decision  tree  of  sequential  battle  of  the  sexes 
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Player  1  (the  man)  chooses  first,  player  2  (the  woman)  observes  player  l’s  choice 
and  then  makes  her  own  choice.  The  first  number  in  each  pair  of  numbers  is  the 
payoff  to  player  1,  and  the  second  number  is  the  payoff  to  player  2.  Filled  circles 
denote  decision  nodes  (of  a  player)  or  end  nodes  (followed  by  payoffs). 

Solution  An  obvious  way  to  analyze  this  game  is  by  working  backwards.  If  player  1 
chooses  F,  then  it  is  optimal  for  player  2  to  choose  F  as  well,  and  if  player  1  chooses 
B ,  then  it  is  optimal  for  player  2  to  choose  B  as  well.  Given  this  choice  behavior  of 
player  2  and  assuming  that  player  1  performs  this  line  of  reasoning  about  the  choices 
of  player  2,  player  1  should  choose  F. 

Comments  What  this  simple  example  shows  is  that  in  such  a  so-called  extensive 
form  game,  there  is  a  distinction  between  a  play  plan  of  a  player  and  an  actual  move 
or  choice  of  that  player.  Player  2  has  the  plan  to  choose  F  ( B )  if  player  1  has  chosen 
F  ( B ).  Player  2’s  actual  choice  is  F — assuming  as  above  that  player  1  has  chosen 
F.  We  use  the  word  strategy  to  denote  a  play  plan,  and  the  word  action  to  denote 
a  particular  move.  In  a  one-shot  game  there  is  no  difference  between  the  two,  and 
then  the  word  ‘strategy’  is  used. 

Games  in  extensive  form  are  studied  in  Chaps.  4,  5,  and  14.  The  solution 
described  above  is  an  example  of  a  so-called  backward  induction  (or  subgame 
perfect)  (Nash)  equilibrium.  There  are  other  Nash  equilibria  as  well.  Suppose  player 
1  chooses  B  and  player  2’s  plan  (strategy)  is  to  choose  B  always,  independent  of 
player  l’s  choice.  Observe  that,  given  the  strategy  of  the  opponent,  no  player  can 
do  better,  and  so  this  combination  is  a  Nash  equilibrium,  although  player  2’s  plan  is 
only  partly  ‘credible’:  if  player  1  would  choose  F  instead  of  B ,  then  player  2  would 
be  better  off  by  changing  her  choice  to  F. 

1. 3.3.2  Sequential  Cournot 

Story  The  story  is  similar  to  the  one  in  Sect.  1.3.2,  but  we  now  assume  that  firm  1 
chooses  first  and  firm  2  can  observe  the  choice  of  firm  1 . 

Model  Since  each  player  i  —  1,2  has  infinitely  many  actions  qi  >  0,  we  cannot 
draw  a  picture  like  Fig.  1.1  for  the  sequential  Battle  of  the  Sexes.  Instead  of  straight 
lines  we  use  zigzag  lines  to  denote  a  continuum  of  possible  actions.  For  this  example 
we  obtain  Fig.  1.2. 

Player  1  moves  first  and  chooses  q\  >  0.  Player  2  observes  player  l’s  choice  of  q\ 
and  then  chooses  <72  >  0. 

1  q\  >  0  2  <72  >  0 

IWVWVVVVfWWVVVVVf  q\{\  ~  q\  ~  qi),qiC  ~  q\  ~  qi) 

Fig.  1.2  The  extensive  form  of  sequential  Cournot 
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Solution  Like  in  the  sequential  Battle  of  the  Sexes  game,  an  obvious  way  to  solve 
this  game  is  by  working  backwards.  Given  the  observed  choice  q\ ,  player  2’s  optimal 
(profit  maximizing)  choice  is  ^2  =  ^(1  —  q\)  or  <72  =  0,  whichever  is  larger.  Given 
this  reaction  function  of  player  2,  the  optimal  choice  of  player  1  is  obtained  by 
maximizing  the  profit  function  q\  i->  q\  (l  —  q\  —  |(1  —  qf) ).  The  maximum  is 

obtained  for  q\  —  | .  Consequently,  player  2  chooses  <72  =  \- 

Comments  The  solution  described  here  is  another  example  of  a  backward  induction 
or  subgame  perfect  equilibrium.  It  is  also  called  Stackelberg  equilibrium.  See 
Chap.  6. 

1. 3.3.3  Entry  Deterrence 

Story  An  old  question  in  industrial  organization  is  whether  an  incumbent  monop¬ 
olist  can  maintain  his  position  by  threatening  to  start  a  price  war  against  any  new 
firm  that  enters  the  market.  In  order  to  analyze  this  question,  consider  the  following 
situation.  There  are  two  players,  the  entrant  and  the  incumbent.  The  entrant  decides 
whether  to  Enter  ( E )  or  to  Stay  Out  (O).  If  the  entrant  enters,  the  incumbent  can 
Collude  (C)  with  him,  or  Fight  (F)  by  cutting  the  price  drastically.  The  payoffs  are 
as  follows.  Market  profits  are  100  at  the  monopoly  price  and  0  at  the  fighting  price. 
Entry  costs  10.  Collusion  shares  the  monopoly  profits  evenly. 

Model  This  situation  can  be  represented  by  the  decision  tree  in  Fig.  1.3. 

Solution  By  working  backward,  we  find  that  the  entrant  enters  and  the  incumbent 
colludes. 

Comments  As  in  the  sequential  battle  of  the  sexes  there  exists  another  Nash 
equilibrium.  If  the  entrant  stays  out  and  the  incumbent’s  plan  is  to  fight  if  the  entrant 
would  enter,  then  also  this  is  a  combination  where  no  player  can  do  better  given  the 
strategy  of  the  other  player.  Again,  one  might  argue  that  the  ‘threat’  of  the  incumbent 
firm  to  start  a  price  war  in  case  the  potential  entrant  would  enter,  is  not  credible  since 
the  incumbent  hurts  himself  by  carrying  out  the  threat. 


Fig.  1.3  The  game  of  entry 
deterrence.  Payoffs:  entrant, 
incumbent 


Incumbent 


E 


Entrant 


O 
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1. 3.3.4  Entry  Deterrence  with  Incomplete  Information 

Story  Consider  the  following  variation  on  the  foregoing  entry  deterrence  model. 
Suppose  that  with  50  %  probability  the  incumbent’s  payoff  from  Fight  (F)  is  equal 
to  some  amount  v  rather  than  the  0  above.  (Here,  v  is  a  given  number.  It  is  not 
a  variable  but  a  parameter.)  The  entrant  still  moves  first  and  knows  whether  this 
payoff  is  v  or  0.  The  incumbent  moves  last  and  does  not  know  whether  this  payoff  is 
v  or  0  when  he  moves.  Both  firms  know  the  probabilities  of  the  payoff  being  v  or  0. 
This  situation  might  arise,  for  instance,  if  the  capacity  of  the  entrant  firm  is  private 
information.  A  positive  value  of  v  might  be  associated  with  the  entrant  having  a 
capacity  constraint  which  leaves  a  larger  share  of  the  market  to  the  incumbent  if  he 
fights.  The  incumbent  estimates  the  probability  of  the  entrant  having  this  capacity 
constraint  at  50  %,  and  the  entrant  knows  this. 

Model  This  situation  can  be  modelled  by  including  a  chance  move  in  the  game  tree. 
Moreover,  the  tree  should  express  the  asymmetric  information  between  the  players. 
Consider  the  game  tree  in  Fig.  1.4.  First  there  is  a  chance  move.  The  entrant  learns 
the  outcome  of  the  chance  move  and  decides  to  enter  or  not.  If  he  enters,  then  the 
incumbent  decides  to  collude  or  fight,  without  however  knowing  the  outcome  of  the 
chance  move:  this  is  indicated  by  the  dashed  line.  Put  otherwise,  the  incumbent  has 
two  decision  nodes  where  he  should  choose,  but  he  does  not  know  at  which  node  he 
actually  is.  Thus,  he  can  only  choose  between  ‘collude’  and  ‘fight’,  without  making 
this  choice  contingent  on  the  outcome  of  the  chance  move. 

Solution  If  x  <  50  then  an  obvious  solution  is  that  the  incumbent  colludes  and 
the  entrant  enters.  Also  the  combination  of  strategies  where  the  entrant  stays  out  no 
matter  what  the  outcome  of  the  chance  move  is,  and  the  incumbent  fights,  is  a  Nash 
equilibrium.  A  complete  analysis  is  more  subtle  and  may  include  a  consideration  of 
the  probabilistic  information  that  the  incumbent  might  derive  from  the  action  of  the 
entrant  in  a  so-called  perfect  Bayesian  equilibrium ,  see  Chaps.  5  and  14. 


Fig.  1 .4  Entry  deterrence 
with  incomplete  information 


0,100 


Entrant 


O 


E 


50% 


Chance  O  Inc. 


50% 


O 


E 


40,50 


0,100  • 


Entrant 
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Fig.  1.5  Simultaneous  battle  of  the  sexes  in  extensive  form 


Comments  The  collection  of  the  two  nodes  of  the  incumbent,  connected  by  the 
dashed  line,  is  called  an  information  set.  In  general,  information  sets  are  used  to 
model  imperfect  information.  In  the  present  example  imperfect  information  arises 
since  the  incumbent  does  not  know  the  outcome  of  the  chance  move.  Imperfect 
information  can  also  arise  if  some  player  does  not  observe  some  move  of  some 
other  player.  As  a  simple  example,  consider  again  the  simultaneous  move  Battle  of 
the  Sexes  game  of  Sect.  1.3.2.  This  can  be  modelled  as  a  game  in  extensive  form  as  in 
Fig.  1.5.  Hence,  player  2,  when  he  moves,  does  not  know  what  player  1  has  chosen. 
This  is  equivalent  to  players  1  and  2  moving  independently  and  simultaneously. 


1 .3.4  Cooperative  Games 

In  a  cooperative  game  the  focus  is  on  payoffs  and  coalitions,  rather  than  on 
strategies.  The  prevailing  analyses  have  an  axiomatic  flavor,  in  contrast  to  the 
equilibrium  analysis  of  noncooperative  theory.  The  implicit  assumption  is  that 
players  can  make  binding  agreements. 

1. 3.4.1  Three  Cooperating  Cities 

Story  Cities  1,  2  and  3  want  to  be  connected  with  a  nearby  power  source.  The 
possible  transmission  links  and  their  costs  are  shown  in  the  following  figure.  Each 
city  can  hire  any  of  the  transmission  links.  If  the  cities  cooperate  in  hiring  the  links 
they  save  on  the  hiring  costs  (the  links  have  unlimited  capacity).  The  situation  is 
represented  in  Fig.  1.6. 

Model  The  players  in  this  situation  are  the  three  cities.  Denote  the  player  set  by 
N  —  {1,2,3}.  These  players  can  form  coalitions:  any  subset  S  of  N  is  called  a 
coalition.  Table  1.1  presents  the  costs  as  well  as  the  savings  of  each  coalition. 


1.3  Examples 
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Fig.  1.6  Situation  leading  to  the  three  cities  game 


Table  1.1  The  three  cities 
game 


s 

{1} 

{2} 

{3} 

{1.2} 

{1,3} 

{2,3} 

{1,2,3} 

c(S) 

100 

140 

130 

150 

130 

150 

150 

v(S) 

0 

0 

0 

90 

100 

120 

220 

The  costs  c(S')  are  obtained  by  calculating  the  cheapest  routes  connecting 
the  cities  in  the  coalition  S  with  the  power  source.  The  cost  savings  v(S)  are 
determined  by 


v(S)  :=  E  c({/})  —  c(S)  for  each  nonempty  S  c  N. 

ieS 

The  cost  savings  v  ( S )  for  coalition  S  are  equal  to  the  difference  in  costs  correspond¬ 
ing  to  the  situation  where  all  members  of  S  work  alone  and  the  situation  where  all 
members  of  S  work  together.  The  pair  (N,  v )  is  called  a  cooperative  game. 

Solution  Basic  questions  in  a  cooperative  game  (N,  v )  are:  which  coalitions  will 
actually  be  formed,  and  how  should  the  worth  (savings)  of  such  a  coalition  be 
distributed  among  its  members?  To  form  a  coalition  the  consent  of  every  member  is 
needed,  but  it  is  likely  that  the  willingness  of  a  player  to  participate  in  a  coalition 
depends  on  what  that  player  obtains  in  that  coalition.  Therefore,  the  second  question 
seems  to  be  the  more  basic  one,  and  in  this  book  attention  is  focussed  on  that 
question.  Specifically,  it  is  usually  assumed  that  the  grand  coalition  N  of  all  players 
is  formed,  and  the  question  is  then  reduced  to  the  problem  of  distributing  the 
amount  v  (N)  among  the  players.  In  the  present  example,  how  should  the  amount  220 
(=  v(N))  be  distributed  among  the  three  cities?  In  other  words,  we  look  for  vectors 
x  =  (x\,  *2,  X3)  e  M3  such  that  x\  +  X2  +  V3  =  220,  where  player  i  e  {1, 2,  3} 
obtains  xt.  One  obvious  candidate  is  to  choose  x\  —  X2  —  X3  —  220/3,  but  this 
does  not  really  reflect  the  asymmetry  of  the  situation:  some  coalitions  save  more 
than  others.  The  literature  offers  many  quite  different  solutions  to  this  distribution 
problem,  among  which  are  the  core ,  the  Shapley  value ,  and  the  nucleolus. 

The  core  consists  of  those  payoff  distributions  that  cannot  be  improved  upon  by 
any  smaller  coalition.  For  the  three  cities  example,  this  means  that  the  core  consists 
of  those  vectors  (xi,V2,V3)  such  that  x\  +V2+X3  =  220,x\,X2,X3  >  0,  X1+X2  >  90, 


14 


1  Introduction 


x\  +X3  >  100,  and  *2  +  V3  >  120.  Hence,  this  is  a  large  set  and  therefore  it  is  rather 
indeterminate  as  a  solution  to  the  game. 

In  contrast,  the  Shapley  value  consists  by  definition  of  one  point  (vector). 
Roughly,  according  to  the  Shapley  value,  each  player  receives  his  average  con¬ 
tribution  to  the  worth  (savings)  of  coalitions.  More  precisely,  imagine  the  players 
entering  the  ‘bargaining  room’  one  at  a  time,  say  first  player  1,  then  player  2,  and 
finally  player  3.  When  player  1  enters,  he  forms  a  coalition  on  his  own,  which 
has  worth  0.  When  player  2  enters,  they  form  the  coalition  {1,2},  so  that  the 
contribution  of  player  2  is  equal  to  i?({l,  2})  —  u  ({ 1 })  =  90  —  0  =  90.  Finally, 
player  3  enters  and  they  form  the  grand  coalition.  Player  3’s  contribution  is  equal 
to  v(N)  —  v({l,  2})  =  220  —  90  =  130.  Hence,  this  results  in  the  payoff  vector 
(0,  90, 130).  Now  the  Shapley  value  is  obtained  by  repeating  this  argument  for  the 
five  other  possible  orderings  in  which  the  players  can  enter  the  bargaining  room, 
and  then  taking  the  average  of  the  six  payoff  vectors.  In  this  example  this  results  in 
the  distribution  (65,  75,  80). 

Also  the  nucleolus  consists  of  one  point,  in  this  case  the  vector  (56 1, 76 1,  86 1). 
The  nucleolus  is  more  complicated  to  define  and  harder  to  compute,  and  at  this  stage 
the  reader  should  take  these  numbers  for  granted. 

Formal  definitions  of  all  these  concepts  are  provided  in  Chap.  9.  See  also 
Chaps.  16-20. 

Comments  The  implicit  assumptions  for  a  game  like  this  are,  first,  that  a  coalition 
which  is  actually  formed,  can  make  binding  agreements  on  the  distribution  of  its 
payoff  and,  second,  that  any  payoff  distribution  which  distributes  (or,  at  least,  does 
not  exceed)  the  savings  (or,  more  generally,  worth )  of  the  coalition  is  possible.  For 
these  reasons,  such  games  are  called  cooperative  games  with  transferable  utility. 

1 .3.4.2  The  Glove  Game 

Story  Assume  that  there  are  three  players,  1,  2,  and  3.  Players  1  and  2  each  possess 
a  right-hand  glove,  while  player  3  has  a  left-hand  glove.  A  pair  of  gloves  has  worth 
1 .  The  players  cooperate  in  order  to  generate  a  profit. 

Model  The  associated  cooperative  game  is  described  by  Table  1.2. 

Solution  The  core  of  this  game  consists  of  exactly  one  vector.  The  Shapley  value 
assigns  2/3  to  player  3  and  1  /  6  to  both  player  1  and  player  2  (see  Problem  1.6).  The 
nucleolus  is  the  unique  element  of  the  core. 


5 

{1} 

{2} 

{3} 

{1,2} 

0,3} 

{2,3} 

{1,2,3} 

v(S) 

0 

0 

0 

0 

1 

1 

1 

Table  1 .2  The  glove  game 
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Table  1 .3  Preferences  for 
dentist  appointments 


Table  1 .4  The  dentist  game: 
a  permutation  game 


Mon 

Tue 

Wed 

Adams 

2 

4 

8 

Benson 

10 

5 

2 

Cooper 

10 

6 

4 

5 

{1} 

{2} 

{3} 

0,2} 

0,3} 

{2,3} 

{1,2,3} 

v(S) 

2 

5 

4 

14 

18 

9 

24 

1. 3.4.3  A  Permutation  Game 

Story  Mr.  Adams,  Mrs.  Benson,  and  Mr.  Cooper  have  appointments  with  the  dentist 
on  Monday,  Tuesday,  and  Wednesday,  respectively.  This  schedule  not  necessarily 
matches  their  preferences,  due  to  different  urgencies  and  other  factors.  These 
preferences  (expressed  in  numbers)  are  given  in  Table  1.3. 

Model  This  situation  gives  rise  to  a  game  in  which  the  coalitions  can  gain  by 
reshuffling  their  appointments.  For  instance,  Adams  (player  1)  and  Benson  (player 
2)  can  change  their  appointments  and  obtain  a  total  of  14  instead  of  7.  A  complete 
description  of  the  resulting  game  is  given  in  Table  1.4. 

Solution  The  core  of  this  game  is  the  convex  hull  of  the  vectors  (15, 5,  4), (14, 6,  4), 
(8,6, 10),  and  (9,5, 10),  i.e.,  it  is  the  quadrangle  with  these  points  as  vertices, 
together  with  its  inside.  The  Shapley  value  is  the  vector  (9^,  6^,  8),  and  the 
nucleolus  is  the  vector  (1 1 1 , 5^ ,  7). 

Comments  See  Chap.  20  for  an  analysis  of  permutation  games. 

1. 3.4.4  A  Voting  Game 

The  United  nations  Security  Council  consists  of  five  permanent  members  (United 
States,  Russia,  Britain,  France,  and  China)  and  ten  other  members.  Motions  must 
be  approved  by  at  least  nine  members,  including  all  the  permanent  members.  This 
situation  gives  rise  to  a  15-player  so-called  voting  game  ( N ,  v)  with  u(5)  =  1  if 
the  coalition  S  contains  the  five  permanent  members  and  at  least  four  nonpermanent 
members,  and  v(S)  =  0  otherwise.  Games  with  worths  0  or  1  are  also  called  simple 
games.  Coalitions  with  worth  equal  to  1  are  called  winning,  the  other  coalitions  are 
called  losing.  Simple  games  are  studied  in  Chap.  16. 

A  solution  to  such  a  voting  game  is  interpreted  as  representing  the  power  of  a 
player,  rather  than  payoff  (money)  or  utility. 


3  The  numbers  in  this  table  are  the  total  payoffs  to  coalitions  and  not  the  net  payoffs  compared  to 
the  coalition  members  staying  alone  instead  of  cooperating.  These  would  be,  respectively,  0,  0,  0, 
7,  12,  0,  and  13. 
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1 .3.5  Bargaining  Games 

Bargaining  theory  focusses  on  agreements  between  individual  players. 

1. 3.5.1  A  Division  Problem 

Story  Consider  the  following  situation.  Two  players  have  to  agree  on  the  division  of 
one  unit  of  a  perfectly  divisible  good,  say  a  liter  of  wine.  If  they  reach  an  agreement, 
say  (a,  /3)  where  a,  j3  >  0  and  a  +  /3  <  1 ,  then  they  split  up  the  one  unit  according  to 
this  agreement;  otherwise,  they  both  receive  nothing.  The  players  have  preferences 
for  the  good,  described  by  utility  functions. 

Model  To  fix  ideas,  assume  that  player  1  has  a  utility  function  u\  (of)  =  a  and  player 
2  has  a  utility  function  U2(a)  =  y/ot.  Thus,  a  distribution  (a,  1  —  a)  of  the  good  leads 
to  a  corresponding  pair  of  utilities  (u\(a),  U2(l  —  «))  =  (a,  y/l  —  a).  By  letting  a 
range  from  0  to  1  we  obtain  all  utility  pairs  corresponding  to  all  distributions  of 
the  whole  unit  of  the  good:  this  is  the  bold  curve  in  Fig.  1.7.  It  is  assumed  that  also 
distributions  summing  to  less  than  the  whole  unit  are  theoretically  possible.  This 
yields  the  whole  shaded  region. 

Solution  According  to  the  Nash  bargaining  solution  this  bargaining  problem 
should  be  solved  as  follows:  maximize  the  product  of  the  players’  utilities  on  the 
shaded  area.  Since  this  maximum  will  be  reached  on  the  boundary,  the  problem  is 
equivalent  to 


max  aV  1  —  a. 

0<a<  1 


The  maximum  is  obtained  for  a  —  | .  So  the  solution  of  the  bargaining  problem  in 
utilities  equals  (|,  |x/3),  which  is  the  point  z  in  Fig.  1.7.  This  implies  that  player 

%  r\  1 

1  obtains  |  of  the  1  unit  of  the  good,  whereas  player  2  obtains  As  described 


Fig.  1.7  A  bargaining  game 


1 


U2 


3 


U\ 
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here,  this  solution  may  seem  to  come  out  of  the  blue,  but  it  can  be  backed  up 
axiomatically,  see  below. 

Comments  For  an  axiomatic  characterization  of  the  Nash  bargaining  solution,  see 
Chaps.  10  and  21.  The  bargaining  literature  also  includes  many  noncooperative, 
strategic  approaches  to  the  bargaining  problem,  notably  the  Rubinstein  alternating 
offers  model:  see  Chaps.  6  and  21.  The  bargaining  game  can  be  seen  as  a  special 
case  of  a  cooperative  game  without  transferable  utility.  Also  games  with  transferable 
utility  form  a  subset  of  the  more  general  class  of  games  without  transferable  utility. 
See  Chap.  21. 


1 .4  Cooperative  Versus  Noncooperative  Game  Theory 

The  usual  distinction  between  cooperative  and  noncooperative  game  theory  is  that 
in  a  cooperative  game  binding  agreements  between  players  are  possible,  whereas 
this  is  not  the  case  in  noncooperative  games.  This  distinction  is  informal  and 
also  not  very  clear-cut:  for  instance,  the  core  of  a  cooperative  game  has  a  clear 
noncooperative  flavor;  a  concept  such  as  correlated  equilibrium  for  noncooperative 
games  (see  Sect.  13.7)  has  a  clear  cooperative  flavor.  Moreover,  quite  some  game- 
theoretic  literature  is  concerned  with  viewing  problems  both  from  a  cooperative 
and  a  noncooperative  perspective.  The  latter  approach  is  sometimes  called  the 
Nash  program ;  the  bargaining  problem  discussed  above  is  a  typical  example. 
In  a  much  more  precise  sense,  the  theory  of  implementation  is  concerned  with 
representing  outcomes  from  cooperative  solutions  as  equilibrium  outcomes  of 
specific  noncooperative  solutions. 

A  workable  distinction  between  cooperative  and  noncooperative  games  can  be 
based  on  the  ‘modelling  technique’  that  is  used:  in  a  noncooperative  game  players 
have  explicit  strategies,  whereas  in  a  cooperative  game  players  and  coalitions  are 
characterized,  more  abstractly,  by  the  outcomes  and  payoffs  that  they  can  reach. 
The  examples  in  Sects.  1.3. 1-1. 3. 3  are  examples  of  noncooperative  games,  whereas 
those  in  Sects.  1.3.4  and  1.3.5  are  examples  of  cooperative  games. 


1.5  Problems 

1.1.  Battle  of  the  Bismarck  Sea 

(a)  Represent  the  ‘Battle  of  the  Bismarck  Sea’  as  a  game  in  extensive  form. 

(b)  Now  assume  that  Imamura  moves  first,  and  Kenney  observes  Imamura’s  move 
and  moves  next.  Represent  this  situation  in  extensive  form  and  solve  by  working 
backwards. 

(c)  Answer  the  same  questions  as  under  (b)  with  now  Kenney  moving  first. 
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1.2.  Variant  of  Matching  Pennies 

Consider  the  following  variant  of  the  ‘Matching  Pennies’  game: 


Heads 

Tails 


Heads  Tails 

x  —1 

-1  1 


-1 


where  i  is  a  real  number.  For  each  value  of  x,  determine  all  saddlepoints  of  the 
game,  if  any. 


1.3.  Mixed  Strategies 

Consider  the  following  zero-sum  game: 


L  R 


(a)  Show  that  this  game  has  no  saddlepoint. 

(b)  Find  a  mixed  strategy  (randomized  choice)  of  (the  row)  player  1  that  makes  his 
expected  payoff  independent  of  player  2’s  strategy. 

(c)  Find  a  mixed  strategy  of  player  2  that  makes  his  expected  payoff  independent  of 
player  l’s  strategy. 

(d)  Consider  the  expected  payoffs  found  under  (b)  and  (c).  What  do  you  conclude 
about  how  the  game  could  be  played  if  randomized  choices  are  allowed? 

1.4.  Sequential  Cournot 

Consider  the  sequential  Cournot  model  in  Sect.  1.3.3,  but  now  based  on  the  market 
price  p  =  2  —  3Q  (or  zero,  whichever  is  larger),  where  Q  is  the  total  quantity 
produced. 

(a)  Represent  this  game  in  extensive  form,  similar  as  in  Fig.  1.2. 

(b)  Solve  this  game  by  working  backwards. 

1.5.  Three  Cooperating  Cities 

(a)  Complete  the  computation  of  the  Shapley  value  of  the  ‘Three  Cooperating  Cities 
Game’.  Is  it  an  element  of  the  core?  Why  or  why  not? 

(b)  Show  that  the  nucleolus  of  this  game  is  an  element  of  the  core. 

1.6.  Glove  Game 

(a)  Compute  the  core  of  the  glove  game. 

(b)  Compute  the  Shapley  value  of  this  game.  Is  it  an  element  of  the  core? 


1.6  Notes 
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1.7.  Dentist  Appointments 

(a)  For  the  permutation  (dentist  appointments)  game,  compute  the  Shapley  value 
and  check  if  it  is  an  element  of  the  core. 

(b)  Show  that  the  nucleolus  of  this  game  is  in  the  core. 

1.8.  Nash  Bargaining 

(a)  Verify  the  computation  of  the  Nash  bargaining  solution  for  the  division  problem 
in  Sect.  1.3.5. 

(b)  Compute  the  Nash  bargaining  outcome,  both  in  utilities  and  in  division  of  the 
good,  when  the  utility  functions  are  u\  (a?)  =  2a  —  a2  and  U2(a)  —  a. 

1.9.  Variant  of  Glove  Game 

Suppose  there  are  n  —  l  +  r  players,  where  l  players  own  a  left-hand  glove  and  r 
players  own  a  right-hand  glove.  Let  N  be  the  set  of  all  players,  let  L  be  the  subset  of 
N  consisting  of  the  players  who  own  a  left-hand  glove,  and  let  R  be  the  subset  of  N 
consisting  of  the  players  who  own  a  right-hand  glove.  Let  S  c  N  denote  an  arbitrary 
coalition  and  let  \S\  denote  the  number  of  players  in  S.  For  instance,  \L\  =  l,  and 
\R  n  S\  is  the  number  of  players  in  S  who  own  a  right-hand  glove.  As  before,  each 
pair  of  gloves  has  worth  1.  Find  an  expression  for  v(S),  i.e.,  the  maximal  profit  that 
S  can  generate  by  cooperation  of  its  members. 


1 .6  Notes 

The  ‘Battle  of  the  Bismarck  Sea’  example  is  taken  from  Rasmusen  (1989).  Also 
see  the  memoirs  of  Churchill  (1983):  in  1953,  Churchill  received  the  Nobel  prize  in 
literature  for  this  work. 

Von  Neumann  (1928)  proved  that  every  two-person  finite  zero-sum  game  in 
which  the  players  can  use  mixed  strategies,  has  a  value.  This  result  is  known  as 
the  minimax  theorem. 

The  Prisoners’  Dilemma  game  has  been  widely  studied  in  the  literature.  Axelrod 
(1984)  described  the  results  of  a  tournament  for  which  players  could  submit 
strategies  for  repeated  play  of  the  Prisoners’  Dilemma:  the  so-called  tit-for-tat 
strategy  emerged  as  a  winning  strategy.  For  the  ‘tragedy  of  the  commons’,  see 
Hardin  (1968),  or  Gibbons  (1992,  p.  27). 

The  concept  of  focal  points  was  developed  by  Schelling  (1960).  Nash  equilibria 
were  first  explicitly  proposed  by  Nash  (1951),  who  proved  that  every  game  in  which 
each  player  has  finitely  many  choices — zero-sum  or  nonzero- sum — has  a  Nash 
equilibrium  in  mixed  strategies.  The  basic  idea  of  a  Nash  equilibrium  is  much  older. 
For  instance,  the  Cournot  equilibrium  was  developed  in  Cournot  (1838). 

Subgame  perfect  equilibria  in  extensive  form  games  were  first  explicitly  studied 
by  Selten  (1965).  Again,  the  basic  idea  occurs  earlier,  for  instance  in  von  Stackelberg 
(1934). 
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The  Three  Cooperating  Cities  is  an  example  of  a  minimum  cost  spanning  tree 
game,  see  Bird  (1976).  The  core  was  first  introduced  in  Gillies  (1953),  the  Shapley 
value  in  Shapley  (1953),  and  the  nucleolus  in  Schmeidler  (1969).  The  dentist 
example  is  taken  from  Curiel  (1997,  p.  54).  The  United  Nations  Security  Council 
game  is  taken  from  Owen  (1995). 

The  Nash  bargaining  solution  was  proposed  and  axiomatically  characterized  in 
Nash  (1950).  Nash  (1953)  proposed  a  noncooperative  game  to  back  up  the  Nash 
bargaining  solution.  Rubinstein  (1982)  modelled  the  bargaining  problem  as  an 
alternating  offers  extensive  form  game.  Binmore  et  al.  (1986)  observed  the  close 
relationship  between  the  Nash  bargaining  solution  and  the  strategic  approach  of 
Rubinstein.  See  Chap.  10. 

The  reference  list  contains  a  number  of  textbooks  on  game  theory,  notably: 
Fudenberg  and  Tirole  (1991a),  Gardner  (1995),  Gibbons  (1992),  Maschler  et  al. 
(2013),  Morris  (1994),  Moulin  (1988),  Moulin  (1995),  Myerson  (1991),  Osborne 
(2004),  Owen  (1995),  Peleg  and  Sudholter  (2003),  Perea  (2012),  Rasmusen  (1989), 
Thomas  (1986),  and  Watson  (2002). 
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Part  I 


Thinking  Strategically 


2 


Finite  Two-Person  Zero-Sum  Games 


This  chapter  deals  with  two-player  games  in  which  each  player  chooses  from  finitely 
many  pure  strategies  or  randomizes  among  these  strategies,  and  the  sum  of  the 
players’  payoffs  or  expected  payoffs  is  always  equal  to  zero.  Games  like  the  Battle 
of  the  Bismarck  Sea  and  Matching  Pennies,  discussed  in  Sect.  1.3.1  belong  to  this 
class. 

In  Sect.  2.1  the  basic  definitions  and  theory  are  discussed.  Section  2.2  shows  how 
to  solve  2  x  n  and  m  x  2  games,  and  larger  games  by  elimination  of  strictly  dominated 
strategies. 


2.1  Basic  Definitions  and  Theory 

Since  all  data  of  a  finite  two-person  zero-sum  game  can  be  summarized  in  one 
matrix,  such  a  game  is  usually  called  a  ‘matrix  game’ . 

Definition  2.1  (Matrix  Game)  A  matrix  game  is  an  mxn  matrix  A  of  real  numbers, 
where  m  is  the  number  of  rows  and  n  is  the  number  of  columns.  A  (mixed)  strategy 
of  player  1  is  a  probability  distribution  p  over  the  rows  of  A,  i.e.,  an  element  of  the 
set 


Am  :=  {p  =  Oi, . . . , Pm)  e  Rm  |  y ^pi  =  1,  Pi  >  0  for  all  i  =  1, . . . ,  m)  . 

i=  1 

Similarly,  a  (mixed )  strategy  of  player  2  is  a  probability  distribution  q  over  the 
columns  of  A,  i.e.,  an  element  of  the  set 

n 

An  :  —  {q  =  (q\ , . . . ,  qn)  e  Rn  |  qj  —  1 ,  qj  >  0  for  all  j  —  l, ...  ,n}  . 

7=1 
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2  Finite  Two-Person  Zero-Sum  Games 


A  strategy  p  of  player  1  is  called  pure  if  there  is  a  row  i  with  pt  —  1 .  This  strategy 
is  also  denoted  by  e? .  Similarly,  a  strategy  q  of  player  2  is  called  pure  if  there  is  a 
column  j  with  qj  —  1.  This  strategy  is  also  denoted  by  e7.  □ 

The  interpretation  of  a  matrix  game  A  is  as  follows.  If  player  1  plays  row  i  (i.e.,  pure 
strategy  e1)  and  player  2  plays  column  j  (i.e.,  pure  strategy  e7),  then  player  1  receives 
payoff  ciy  and  player  2  pays  ay  (and,  thus,  receives  —ay),  where  ay  is  the  number 
in  row  i  and  column  j  of  matrix  A.  If  player  1  plays  strategy  p  and  player  2  plays 
strategy  q,  then  player  1  receives  the  expected  payoff 

m  n 

PA(J  =  J2  H  Pi^av  ’ 

i=  1  7=1 


and  player  2  receives  —  pAq. 

Remark  2.2  (1)  Note  that,  according  to  Definition  2.1,  a  strategy  means  a  mixed 
strategy.  A  pure  strategy  is  a  special  case  of  a  mixed  strategy.  We  add  the  adjective 
‘pure’  if  we  wish  to  refer  to  a  pure  strategy.  (2)  Since  no  confusion  is  likely  to  arise, 
we  do  not  use  transpose  notations  like  prAq  or  pAqr.  □ 

For  ‘solving’  matrix  games,  i.e.,  establishing  what  clever  players  would  or  should 
do,  the  concepts  of  maximin  and  minimax  strategies  are  important,  as  will  be 
explained  below.  First  we  give  the  definitions. 

Definition  2.3  (Maximin  and  Minimax  Strategies)  A  strategy  p  is  a  maximin 
strategy  of  player  1  in  matrix  game  A  if 

minjpAq  |  q  e  An}  >  minfpAq  |  q  e  An}  for  all  p'  e  Am  . 

A  strategy  q  is  a  minimax  strategy  of  player  2  in  matrix  game  A  if 

max{pAq  |  p  e  Am}  <  max{pAq/  |  p  e  Am}  for  all  qr  e  An  . 


□ 

In  words:  a  maximin  strategy  of  player  1  maximizes  the  minimal  (with  respect 
to  player  2’s  strategies)  payoff  of  player  1,  and  a  minimax  strategy  of  player  2 
minimizes  the  maximum  (with  respect  to  player  l’s  strategies)  that  player  2  has 
to  pay  to  player  1 .  (It  can  be  proved  by  basic  mathematical  analysis  that  maximin 
and  minimax  strategies  always  exist.)  Of  course,  the  asymmetry  in  these  definitions 
is  caused  by  the  fact  that,  by  convention,  a  matrix  game  represents  the  amounts  that 
player  2  has  to  pay  to  player  1 . 

In  order  to  check  if  a  strategy  p  of  player  1  is  a  maximin  strategy  it  is  sufficient  to 
check  that  the  first  inequality  in  Definition  2.3  holds  with  e7  for  every  j  —  1 , ...  ,n 
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instead  of  every  q  e  A".  This  is  not  difficult  to  see  but  the  reader  is  referred  to 
Chap.  12  for  a  formal  treatment.  A  similar  observation  holds  for  minimax  strategies. 
In  other  words,  to  check  if  a  strategy  is  maximin  (minimax)  it  is  sufficient  to  consider 
its  performance  against  every  pure  strategy,  i.e.,  column  (row). 

Why  would  we  be  interested  in  such  strategies?  At  first  glance,  these  strategies 
seem  to  express  a  very  conservative  or  pessimistic,  worst-case  scenario  attitude.  The 
reason  for  nevertheless  considering  maximin/minimax  strategies  is  provided  by  the 
so-called  minimax  theorem ,  which  states  that  for  every  matrix  game  A  there  is  a  real 
number  v  =  v(A)  with  the  following  properties: 

(a)  A  strategy  p  of  player  1  guarantees  a  payoff  of  at  least  v  to  player  1  (i.e.,  pAq  > 
v  for  all  strategies  q  of  player  2)  if  and  only  if  p  is  a  maximin  strategy. 

(b)  A  strategy  q  of  player  2  guarantees  a  payment  of  at  most  v  by  player  2  to  player 
1  (i.e.,  pAq  <  v  for  all  strategies  p  of  player  1)  if  and  only  if  q  is  a  minimax 
strategy. 

Hence,  player  1  can  obtain  a  payoff  of  at  least  v  by  playing  a  maximin  strategy, 
and  player  2  can  guarantee  to  pay  not  more  than  v — hence  secure  a  payoff  of  at 
least  —  v — by  playing  a  minimax  strategy.  For  these  reasons,  the  number  v  =  v(A) 
is  also  called  the  value  of  the  game  A — it  represents  the  worth  to  player  1  of  playing 
the  game  A — and  maximin  and  minimax  strategies  are  called  optimal  strategies  for 
players  1  and  2,  respectively. 

Therefore,  ‘solving’  the  game  A  means,  naturally,  determining  the  optimal 
strategies  and  the  value  of  the  game.  In  the  Battle  of  the  Bismarck  Sea  in  Sect.  1.3.1, 
the  pure  strategies  N  of  both  players  guarantee  the  same  amount  2.  Therefore,  this  is 
the  value  of  the  game  and  N  is  optimal  for  both  players.  The  analysis  of  that  game  is 
easy  since  it  has  a  ‘saddlepoint’,  namely  position  (1,  1)  with  an  =  2.  The  definition 
of  a  saddlepoint  is  as  follows. 

Definition  2.4  (Saddlepoint)  A  position  (/,/)  in  a  matrix  game  A  is  a  saddlepoint  if 

®ij  >  cikj  for  all  k  —  1 , ,m  and  for  all  k  —  1 , . . . , n  , 

i.e.,  if  aij  is  maximal  in  its  column  j  and  minimal  in  its  row  i.  □ 

Clearly,  if  (ij)  is  a  saddlepoint,  then  player  1  can  guarantee  a  payoff  of  at  least 
by  playing  the  pure  strategy  row  i,  since  a\j  is  minimal  in  row  i.  Similarly,  player  2 
can  guarantee  a  payoff  of  at  least  —ay  by  playing  the  pure  strategy  column  j ,  since 
ay  is  maximal  in  column  j.  Hence,  a#  must  be  the  value  of  the  game  A:  v(A)  =  a^,  el 
is  an  optimal  (maximin)  strategy  of  player  1 ,  and  e7  is  an  optimal  (minimax)  strategy 
of  player  2. 
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2.2  Solving  2  xn  Games  and  m  x2  Games 

In  this  section  we  show  how  to  solve  matrix  games  where  at  least  one  of  the  players 
has  only  two  pure  strategies.  We  also  show  how  the  idea  of  strict  domination  can  be 
of  help  in  solving  matrix  games. 


2.2.1  2  x  n  Games 

We  demonstrate  how  to  solve  a  matrix  game  with  2  rows  and  n  columns  graphically, 
by  considering  the  following  2x4  example: 

e1  e2  e3  e4 

no  2  4  1  \ 

A~  \2  10  8  12 )  ' 

We  have  labelled  the  columns  of  A,  i.e.,  the  pure  strategies  of  player  2  for  reference 
below.  Let  p  =  (p,  1  —  p)  be  an  arbitrary  strategy  of  player  1.  The  expected  payoffs 
to  player  1  if  player  2  plays  a  pure  strategy  are  equal  to: 

pAe1  =  10 p  +  2(1  —  p)  —  %p  +  2 
pAe2  —  2 p  -\-  10(1  —  p)  —  10  —  8/7 
pAe3  =  4 p  +  8(1  —  p)  =8  —  4 p 
pAe4  —  p  12(1  —p)  =12-11  p. 

We  plot  these  four  linear  functions  of  p  in  one  diagram: 
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10 

8 
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6 

4 
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In  this  diagram  the  values  of  p  are  plotted  on  the  horizontal  axis,  and  the  four  straight 
gray  lines  plot  the  payoffs  to  player  1  for  each  of  the  four  pure  strategies  of  player  2. 
Observe  that  for  every  0  <  p  <  1  the  minimum  payoff  that  player  1  may  obtain  is 
given  by  the  lower  envelope  of  these  curves,  the  thick  black  curve  in  the  diagram: 
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for  any  7?,  any  combination  ( q\ ,  *72 ,  *73,  <74)  of  the  points  on  e1,  e2,  e3,  and  e4  with  first 
coordinate  p  would  lie  on  or  above  this  lower  envelope.  Clearly,  the  lower  envelope 
is  maximal  for p  =  p*  =  and  the  maximal  value  is  6.  Hence,  we  have  established 
that  player  1  has  a  unique  optimal  (maximin)  strategy,  namely  p*  =  (^,  |),  and  that 
the  value  of  the  game,  i?(A),  is  equal  to  6. 

What  are  the  optimal  or  minimax  strategies  of  player  2?  From  the  theory  of  the 
previous  section  we  know  that  a  minimax  strategy  q  =  (*71,  *72,  *73,  q 4)  of  player  2 
should  guarantee  to  player  2  to  have  to  pay  at  most  the  value  of  the  game.  From 
the  diagram  it  is  clear  that  <74  should  be  equal  to  zero  since  otherwise  the  payoff  to 
player  1  would  be  larger  than  6  if  player  1  plays  (^,  ^),  and  thus  q  would  not  be  a 
minimax  strategy.  So  a  minimax  strategy  has  the  form  q  =  (cp ,  q2,  <73, 0).  Any  such 
strategy,  plotted  in  the  diagram,  gives  a  straight  line  that  is  a  combination  of  the 
lines  associated  with  e1,  e2,  and  e3  and  which  passes  through  the  point  (|,  6)  since 
all  three  lines  pas  through  this  point.  Moreover,  for  no  value  of  p  should  this  straight 
line  exceed  the  value  6,  otherwise  q  would  not  guarantee  a  payment  of  at  most  6 
by  player  2.  Consequently,  this  straight  line  has  to  be  horizontal.  Summarizing  this 
argument,  we  look  for  numbers  *71 ,  *72,  <73  >  0  such  that 

2q\  +  10^2  +  8*73  =  6  (left  endpoint  should  be  (0,  6)) 

10*7i  2*72  +  4*73  =  6  (right  endpoint  should  be  (1, 6)) 

*71  +  *72  +  <73  =  1  (q  is  a  probability  vector)  . 

By  substitution,  it  is  easy  to  reduce  this  system  of  equations  to  the  two  equations 


3*7i  -  <72  =  1 
<7i  +  #2  +  <73  —  1  • 

In  fact,  one  of  the  two  first  equations  could  have  been  omitted  from  the  beginning, 
since  we  already  know  that  any  combination  of  the  three  lines  passes  through  (^,6), 
and  two  points  are  sufficient  to  determine  a  straight  line. 

From  the  remaining  two  equations,  we  obtain  that  the  set  of  optimal  strategies  of 
player  2  is 


{q  =  (41,42,43,44)  e  A4 


q2  =  3gi 


1,  44  =  0}  . 


Note  that,  if  q\  =  i,  then  q2  =  0,  and  if  q\  =  l,  then  q2  =  i.  Clearly,  q\  and  q2 
cannot  be  smaller,  since  then  their  sum  would  be  negative,  and  they  cannot  be  larger 
since  then  their  sum  would  exceed  1 .  Hence,  the  set  of  optimal  strategies  of  player 
2  can  alternatively  be  described  as 


{q  =  (<71,^2,43,  <?4)  e  Rz 


1  1 

-  <  4i  <  42  =  34i-1,  43  =  I-41-42,  44  =  0} 
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This  means  that  the  set  of  optimal  strategies  of  player  2  in  this  game  is  one¬ 
dimensional,  i.e.,  a  line  segment. 


2.2.2  m  x  2  Games 

The  solution  method  to  solve  m  x  2  games  is  analogous.  Consider  the  following 
example: 


e1 

(10 

2  \ 

e2 

2 

10 

e3 

4 

8 

e4 

V  1 

12/ 

Let  q  =  (q,  1  —  q)  be  an  arbitrary  strategy  of  player  2.  Again,  we  make  a  diagram 
in  which  now  the  values  of  q  are  put  on  the  horizontal  axis,  and  the  straight  lines 
indicated  by  e*  for  i  =  1, 2,  3, 4  are  the  payoffs  to  player  1  associated  with  his  four 
pure  strategies  (rows)  as  functions  of  q.  The  equations  of  these  lines  are  given  by: 

e!Aq  —  10 q  4-2(1  —  q)  —  8 q  4-  2 
e2Aq  =  2q  +  10(1  —  q)  —  10  —  8# 
e3Aq  =  4q  +  8(1  —  q)  =8  —  4  q 
e4Aq  =  q  +  12(1  —  q)  =12—11#. 

The  resulting  diagram  is  as  follows. 
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Observe  that  the  maximum  payments  that  player  2  has  to  make  are  now  located  on 
the  upper  envelope,  represented  by  the  thick  black  curve.  The  minimum  is  reached  at 
the  point  of  intersection  of  e1  and  e4  in  the  diagram,  which  has  coordinates  (j|,  ^). 
Hence,  the  value  of  the  game  is  and  the  unique  optimal  (minimax)  strategy  of 
player  2  is  q*  =  ({§,  ^). 
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To  find  the  optimal  strategy  or  strategies  p  =  (p\,P2,P3,Pa)  of  player  1,  it 
follows  from  the  diagram  that  p2  =  P3  =  0,  otherwise  for  q  —  the  value  ^ 
of  the  game  is  not  reached,  so  that  p  is  not  a  maximin  strategy.  So  we  look  for  a 
combination  of  e1  and  e4  that  gives  at  least  ^  for  every  q ,  hence  it  has  to  be  equal 

to  ^  for  every  q.  This  gives  rise  to  the  equations  2p\  +  12^4  =  I0p\  +  p4  —  ^ 
and  pi  +  p4  =  1,  with  unique  solution  p\  =  and  p\  —  jg.  So  the  unique  optimal 
strategy  of  player  lis(||,0,0,^). 


2.2.3  Strict  Domination 


The  idea  of  strict  domination  can  be  used  to  first  eliminate  pure  strategies  before  the 
graphical  analysis  of  a  matrix  game.  Consider  the  game 

e2  e3  e4  e5 

2  5  1  6\ 

10  8  12  9  J 

This  game  is  obtained  from  the  game  in  Sect.  2.2.1  by  adding  a  fifth  column  and 
changing  a  13  from  4  to  5. 

In  this  game  it  cannot  be  optimal  for  player  2  to  put  positive  probability  on  the 
fifth  column,  since  the  payoffs  in  the  third  column  are  always — that  is,  no  matter 
what  player  l’s  strategy  is — better  for  player  2.  So  we  can  assume  that  the  fifth 
column  is  played  with  zero  probability:  it  is  strictly  dominated  by  the  third  column. 
This  is  a  case  where  a  pure  strategy  is  strictly  dominated  by  another  pure  strategy. 

It  can  also  happen  that  a  pure  strategy  is  strictly  dominated  by  a  mixed  strategy. 
For  instance,  consider  the  third  column.  The  payoff  5  in  the  first  row  is  in  between 
the  payoffs  10  and  2  in  the  first  row  and  first  and  second  columns,  respectively;  and 
also  the  payoff  8  in  the  second  row  is  in  between  the  payoffs  2  and  10  in  the  second 
row  and  first  and  second  columns.  So  it  may  be  possible  to  find  a  combination  of 
the  first  two  columns  that  results  in  smaller  payoffs  than  those  in  the  third  column. 
In  order  to  see  if  such  a  combination  is  possible,  suppose  that  probability  a  is  put 
on  the  first  column  and  1  —  o'  is  put  on  the  second  column.  The  resulting  payoffs  are 


a 


+  (1  —  O') 


8a  T  2  \ 

10-8oJ  ‘ 


We  wish  to  have  8a  +  2  <  5  and  10  —  8af  <  8,  and  both  inequalities  hold  as 
long  as  \  <  a  <  §.  This  means  that  the  pure  strategy  e3  is  strictly  dominated  by 
any  (mixed)  strategy  (a,  1  —  a,  0,  0, 0),  as  long  as  a  is  in  the  computed  range.  This 
implies  that,  in  an  optimal  strategy,  the  probability  q 3  put  by  player  2  on  the  third 
column  must  be  zero,  otherwise  player  2  could  guarantee  to  pay  less  by  adding  aq 3 
to  the  first  column  and  (1  —  a)  q^  to  the  second  column,  for  any  \  <  a  <  §,  and 
playing  the  third  column  with  zero  probability. 
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The  preceding  analysis  implies  that,  in  order  to  solve  the  above  game,  we  can 
start  by  eliminating  the  third  and  fifth  columns  of  the  matrix.  Thus,  in  the  diagram 
in  Sect.  2.2.1,  we  do  not  have  to  draw  the  line  corresponding  to  e3.  The  value  of  the 
game  is  still  6,  player  1  still  has  a  unique  optimal  strategy  p*  =  (|,  |),  and  player  2 
now  also  has  a  unique  optimal  strategy,  namely  the  one  where  g3  =  0,  which  is  the 
strategy  (|,  |,  0, 0,  0). 

In  general,  strictly  dominated  pure  strategies  in  a  matrix  game  are  not  played 
with  positive  probability  in  any  optimal  strategy  and  can  therefore  be  eliminated 
before  solving  the  game.  Sometimes,  this  idea  can  also  be  used  to  solve  matrix 
games  in  which  each  player  has  more  than  two  pure  strategies  (m,  n  >  2).  Moreover, 
the  idea  can  be  applied  iteratively,  that  is,  after  elimination  of  a  strictly  dominated 
pure  strategy,  in  the  smaller  game  perhaps  another  strictly  dominated  pure  strategy 
can  be  eliminated,  etc.,  until  no  more  pure  strategies  are  strictly  dominated.  See 
Example  2.7  for  an  illustration,  and  see  Chap.  13  for  a  rigorous  treatment. 

We  first  give  the  formal  definition  of  strict  domination,  and  then  discuss  the 
announced  example. 

Definition  2.5  (Strict  Domination)  Let  A  be  an  m  x  n  matrix  game  and  i  a  row.  The 
pure  strategy  e7  is  strictly  dominated  if  there  is  a  strategy  p  =  (p\, . . .  ,pm )  £  A777 
such  that  pAe7  >  eMe7  for  every  j  —  1 , . . . ,  n.  Similarly,  let  j  be  a  column.  The  pure 
strategy  e7  is  strictly  dominated  if  there  is  a  strategy  q  =  (q\ , . . . ,  qn )  £  A77  such 
that  e7Aq  <  eMe7  for  every  i  =  1 , . . . ,  m.  □ 

Remark  2. 6  It  is  not  difficult  to  see  that  if  e7  is  strictly  dominated,  then  it  is  strictly 
dominated  by  some  p  £  A777  with  pi  —  0.  Similarly,  if  e7  is  strictly  dominated,  then 
it  is  strictly  dominated  by  some  q  £  A77  with  qj  —  0.  □ 

Example  2. 7  Consider  the  following  3x3  matrix  game: 


/  6  0 
0  5 
\3  2 


For  player  1,  the  third  strategy  e3  is  strictly  dominated  by  the  strategy  p  = 
(iT  n>°)’ since 


P A  =  (  5  ff  ^ )  and  e3A  =  (  3  2  1 )  . 

Hence,  in  any  optimal  strategy  player  1  puts  zero  probability  on  the  third  row. 
Elimination  of  this  row  results  in  the  matrix 


6  0  2 
0  5  4 


2.3  Problems 
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Now,  player  2’s  third  strategy  e3  is  strictly  dominated  by  the  strategy  q  =  (^,  |,  0), 
since 


Hence,  in  any  optimal  strategy  player  2  puts  zero  probability  on  the  third  column. 
Elimination  of  this  column  results  in  the  matrix 


This  is  a  2  x  2  matrix  game,  which  can  be  solved  by  the  method  in  Sect.  2.2.1  or 
Sect.  2.2.2.  See  Problem  2.1(a).  □ 


2.3  Problems 


2.1.  Solving  Matrix  Games 

Solve  the  following  matrix  games,  i.e.,  determine  the  optimal  strategies  and  the 
value  of  the  game.  Each  time,  start  by  checking  if  the  game  has  a  saddlepoint. 


0 

5 


What  are  the  optimal  strategies  in  the  original  matrix  game  A  in  Example  2.7? 


/  2  -1  0  2\ 

2  0  0  3 

\0  0  -12/ 


[  1  3  1  \ 

2  2  0 

[0  3  2/ 


( 16 

12 

2  \ 

2 

6 

16 

8 

8 

6 

Vo 

7 

8/ 
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/3  1  4  0\ 

^1  2  0  5) 


2.2.  Saddlepoints 

(a)  Let  A  be  an  arbitrary  m  x  n  matrix  game.  Show  that  any  two  saddlepoints  must 
have  the  same  value.  In  other  words,  if  (ij)  and  (k,  l )  are  two  saddlepoints,  show 
that  ciij  =  aki. 

(b)  Let  A  be  a  4  x  4  matrix  game  in  which  (1,1)  and  (4, 4)  are  saddlepoints.  Show 
that  A  has  at  least  two  other  saddlepoints. 

(c)  Give  an  example  of  a  4  x  4  matrix  game  with  exactly  three  saddlepoints. 

2.3.  Maximin  Rows  and  Minimax  Columns 

Row  i  is  a  maximin  row  in  an  mxn  matrix  game  A  if  mmy^  ay  >  min^  y  ay 
for  all  k  e  {1  ,...,m}.  Column  j  is  a  minimax  column  if  ma  aij  < 

maxi€{1 . „}  an  for  all  £  e  {1 

Consider  the  following  matrix  game: 

2  0\ 

2  2 

2  4  ' 

3  \) 


(a)  Determine  all  maximin  rows  and  minimax  columns.  What  can  you  conclude 
from  this  about  the  value  of  this  game? 

i 

(b)  The  value  of  this  game  is  y .  Use  this  to  give  an  argument  why  player  2  will  put 
zero  probability  on  column  2  in  any  minimax  strategy. 

(c)  Determine  all  minimax  strategies  of  player  2  and  all  maximin  strategies  of 
player  1 . 

2.4.  Subgames  of  Matrix  Games 

Consider  the  following  matrix  game: 


[3  1  4  0\ 

VI  2  0  5)  ■ 


(a)  Determine  all  maximin  rows  and  minimax  columns.  (See  Problem  2.3.)  What 
can  you  conclude  from  this  about  the  value  of  this  game? 


Reference 
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(b)  Consider  the  six  different  2  x  2-matrix  games  that  can  be  obtained  by  choosing 
two  columns  from  A,  as  follows: 


(!  })*-  (?  o)  -3-  (?  ?) 

0 ;)*-  0  s) (:  ?)■ 

Determine  the  values  of  all  these  games.  Which  one  must  be  equal  to  the  value 
ofA? 

(c)  Determine  all  maximin  and  minimax  strategies  of  A.  [Hint:  Use  your  answer  to 

(b).] 

2.5.  Rock-Paper-Scissors 

In  the  famous  Rock- Paper- Scissors  two-player  game  each  player  has  three  pure 
strategies:  Rock,  Paper,  and  Scissors.  Here,  Scissors  beats  Paper,  Paper  beats  Rock, 
Rock  beats  Scissors.  Assign  a  1  to  winning,  0  to  a  draw,  and  —1  to  losing.  Model 
this  game  as  a  matrix  game,  try  to  guess  its  optimal  strategies,  and  then  show  that 
these  are  the  unique  optimal  strategies.  What  is  the  value  of  this  game? 


2.4  Notes 

The  theory  of  zero-sum  games  was  developed  by  von  Neumann  (1928),  who 
proved  the  minimax  theorem.  In  general,  matrix  games  can  be  solved  by  Linear 
Programming.  See  Chap.  12  for  details. 


Reference 

von  Neumann,  J.  (1928).  Zur  Theorie  der  Gesellschaftsspiele.  Mathematische  Annalen,  100, 
295-320. 
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In  this  chapter  we  consider  two-player  games  where  each  player  chooses  from 
finitely  many  pure  strategies  or  randomizes  among  these  strategies.  In  contrast  to 
Chap.  2  it  is  no  longer  required  that  the  sum  of  the  players’  payoffs  is  zero  (or, 
equivalently,  constant).  This  allows  for  a  much  larger  class  of  games,  including 
many  games  relevant  for  economic  or  other  applications.  Famous  examples  are  the 
Prisoners’  Dilemma  and  the  Battle  of  the  Sexes  discussed  in  Sect.  1.3.2. 

In  Sect.  3.1  we  introduce  the  model  and  the  concept  of  Nash  equilibrium. 
Section  3.2  shows  how  to  compute  Nash  equilibria  in  pure  strategies  for  arbitrary 
games,  all  Nash  equilibria  in  games  where  both  players  have  exactly  two  pure 
strategies,  and  how  to  use  the  concept  of  strict  domination  to  facilitate  computation 
of  Nash  equilibria  and  to  compute  equilibria  also  of  larger  games.  The  structure 
of  this  chapter  thus  parallels  the  structure  of  Chap.  2.  For  a  deeper  and  more 
comprehensive  analysis  of  finite  two-person  games  see  Chap.  13. 


3.1  Basic  Definitions  and  Theory 

The  data  of  a  finite  two-person  game  can  be  summarized  by  two  matrices.  Usually, 
these  matrices  are  written  as  one  matrix  with  two  numbers  at  each  position. 
Therefore,  such  games  are  often  called  ‘bimatrix  games’.  The  definition  is  as 
follows. 

Definition  3.1  (Bimatrix  Game)  A  bimatrix  game  is  a  pair  of  m  x  n  matrices 
( A,B ),  where  m  is  the  number  of  rows  and  n  the  number  of  columns.  □ 

The  interpretation  of  a  bimatrix  game  (A,  B )  is  that,  if  player  1  (the  row  player)  plays 
row  i  and  player  2  (the  column  player)  plays  column  j,  then  player  1  receives  payoff 
ciij  and  player  2  receives  btj,  where  these  numbers  are  the  corresponding  entries  of  A 
and  B ,  respectively.  Definitions  and  notations  for  pure  and  mixed  strategies,  strategy 
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sets  and  expected  payoffs  are  similar  to  those  for  matrix  games,  see  Sect.  2.1,  but  for 
easy  reference  we  repeat  them  here.  A  (mixed)  strategy  of  player  1  is  a  probability 
distribution  p  over  the  rows  of  A  and  B ,  i.e.,  an  element  of  the  set 

m 

Am  :=  {p  =  (pu... ,  Pm)  €  Km  |  y^Pi  =  1,  Pi  >  0  for  all  /  =  1 

i=  1 

Similarly,  a  (mixed )  strategy  of  player  2  is  a  probability  distribution  q  over  the 
columns  of  A  and  B ,  i.e.,  an  element  of  the  set 

n 

An  :=  {q  =  (qi, ,  qn )  e  |  qj  —  1 ,  qj  >  0  for  all j  —  1 , ...  ,n}  . 

7=1 

A  strategy  p  of  player  1  is  called  pure  if  there  is  a  row  i  with  pt  —  1 .  This  strategy 
is  also  denoted  by  e* .  Similarly,  a  strategy  q  of  player  2  is  called  pure  if  there  is  a 
column  j  with  qj  —  1.  This  strategy  is  also  denoted  by  e7.  If  player  1  plays  p  and 
player  2  plays  q  then  the  payoff  to  player  1  is  the  expected  payoff 

m  n 

pAq  =  yy  ygpiqjaij  , 
i=  1  7=1 

and  the  payoff  to  player  2  is  the  expected  payoff 

m  n 

pSq  =  yy  ygpiqjbij  . 
i=  1  7=1 

As  mentioned,  the  entries  of  A  and  5  are  usually  grouped  together  in  one 
(bi)matrix,  by  putting  the  pair  btj  at  position  (ij)  of  the  matrix.  Cf.  the  examples 
in  Sect.  1.3.2. 

Central  to  noncooperative  game  theory  is  the  idea  of  best  reply.  It  says  that 
a  rational  selfish  player  should  always  maximize  his  (expected)  payoff,  given  his 
knowledge  of  or  conjecture  about  the  strategies  chosen  by  the  other  players. 

Definition  3.2  (Best  Reply)  A  strategy  p  of  player  1  is  a  best  reply  to  a  strategy  q 
of  player  2  in  an  m  x  n  bimatrix  game  (A,  B)  if 


pAq  >  prAq  for  all  pr  e  Am  . 
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Similarly,  q  is  a  best  reply  of  player  2  to  p  if 

p/?q  >  p/?q'  for  all  q'  e  An  . 


□ 

In  a  Nash  equilibrium,  each  player’s  strategy  is  a  best  reply  to  the  other  player’s 
strategy. 

Definition  3.3  (Nash  Equilibrium)  A  pair  of  strategies  (p*,q*)  in  a  bimatrix 
game  (A,  B)  is  a  Nash  equilibrium  if  p*  is  a  best  reply  of  player  1  to  q*  and  q* 
is  a  best  reply  of  player  2  to  p*.  A  Nash  equilibrium  (p*,  q*)  is  called  pure  if  both 
p*  and  q*  are  pure  strategies.  □ 

The  concept  of  a  Nash  equilibrium  can  be  extended  to  arbitrary  games,  including 
games  with  arbitrary  numbers  of  players,  strategy  sets,  and  payoff  functions.  We 
will  see  many  examples  in  later  chapters. 

Every  bimatrix  game  has  a  Nash  equilibrium:  for  a  proof  see  Sect.  13.1.  Gener¬ 
ally  speaking,  the  main  concern  with  Nash  equilibrium  is  not  its  existence  but  rather 
the  opposite,  namely  its  abundance,  as  well  as  its  interpretation.  In  many  games, 
there  are  many  Nash  equilibria,  and  then  the  questions  of  equilibrium  selection  and 
equilibrium  refinement  are  relevant  (cf.  Chap.  13).  With  respect  to  interpretation,  an 
old  question  is  how  the  players  would  come  to  play  a  Nash  equilibrium  in  reality. 
The  definition  of  Nash  equilibrium  does  not  say  anything  about  this. 

For  a  Nash  equilibrium  in  mixed  strategies  as  in  Definition  3.3,  an  additional 
question  is  what  the  meaning  of  such  a  mixed  strategy  is.  Does  it  mean  that 
the  players  actually  randomize  when  playing  the  game?  A  different  and  common 
interpretation  is  that  a  mixed  strategy  of  a  player,  say  player  1 ,  represents  the  belief, 
or  conjecture,  of  the  other  player,  player  2,  about  what  player  1  will  do.  Thus,  it 
embodies  the  ‘strategic  uncertainty’  of  the  players  in  a  game. 

For  now,  we  just  leave  these  questions  aside  and  take  the  definition  of  Nash 
equilibrium  at  face  value.  We  show  how  to  compute  pure  Nash  equilibria  in  general, 
and  all  Nash  equilibria  in  games  where  both  players  have  two  pure  strategies.  Just 
as  in  Chap.  2,  we  also  consider  the  role  of  strict  domination. 


3.2  Finding  Nash  Equilibria 

To  find  all  Nash  equilibria  of  an  arbitrary  bimatrix  game  is  a  difficult  task.  We  refer 
to  Sect.  13.2.3  for  more  discussion  on  this  problem.  Here  we  restrict  ourselves  to, 
first,  the  much  easier  problem  of  finding  all  Nash  equilibria  in  pure  strategies  of 
an  arbitrary  bimatrix  game  and,  second,  to  showing  how  to  find  all  Nash  equilibria 
in  2  x  2  games  graphically.  It  is  also  possible  to  solve  2x3  and  3x2  games 
graphically,  see  Sect.  13.2.2.  For  larger  games,  graphical  solutions  are  impractical 
or,  indeed,  impossible. 
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3.2.1  Pure  Nash  Equilibria 

To  find  the  pure  Nash  equilibria  in  a  bimatrix  game,  one  can  first  determine  the  pure 
best  replies  of  player  2  to  every  pure  strategy  of  player  1 ,  and  next  determine  the 
pure  best  replies  of  player  1  to  every  pure  strategy  of  player  2.  Those  pairs  of  pure 
strategies  that  are  mutual  best  replies  are  the  pure  Nash  equilibria  of  the  game.  To 
illustrate  this  method,  consider  the  bimatrix  game 


W  X  Y  Z 

T  (2,2  4,0  1,1  3,2\ 

M  0,3  1,5  4,4  3,4  . 

B  y  2, 0  2,1  5,1  1,0/ 


First  we  determine  the  pure  best  replies  of  player  2  to  every  pure  strategy  of  player 
1,  indicated  by  underlining  the  corresponding  entries.  This  yields: 


W  X  Y  Z 

T  (2,2  4,0  1,1  3,2\ 

M  0,3  1,5  4,4  3,4  . 

B  y  2, 0  2, 1  5, 1  1,0  / 


Next,  we  determine  the  pure  best  replies  of  player  1  to  every  pure  strategy  of  player 
2,  again  indicated  by  underlining  the  corresponding  entries.  This  yields: 


W  X  Y  Z 

T  (2,2  4,0  1,1  3,2\ 

M  0,3  1,5  4,4  3,4  . 

B  y 2, 0  2,1  5,1  1,0/ 


Putting  the  two  results  together  yields: 


W  X  Y  Z 

T  (2,2  4,0  1,1  3,2\ 

M  0,3  1,5  4,4  3,4  . 

B  y 2, 0  2, 1  5, 1  1,0/ 


We  conclude  that  the  game  has  three  Nash  equilibria  in  pure  strategies,  namely 
(T,W),  ( T,Z ),  and  ( B,Y ).  In  mixed  strategy  notation,  these  are  the  pairs 
(e^e1),  (e^e4),  and  (e3,e3).  In  more  extensive  notation:  ((1,0,0),  (1,0,  0,0)), 
((1,0, 0),  (0,  0,  0, 1)),  and  ((0, 0, 1),  (0,  0, 1,0)). 

Strictly  speaking,  one  should  also  consider  mixed  best  replies  to  a  pure  strategy 
in  order  to  establish  whether  this  pure  strategy  can  occur  in  a  Nash  equilibrium, 
but  it  is  not  difficult  to  see  that  any  mixed  best  reply  is  a  combination  of  pure  best 
replies  and,  thus,  can  never  lead  to  a  higher  payoff.  For  instance,  in  the  example 
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above,  any  strategy  of  the  form  (q,  0,  0,  1  —  q)  played  against  T  yields  to  player  2 
a  payoff  of  2  (=  2q  +  2(1  —  q))  and  is  therefore  a  best  reply,  but  does  not  yield  a 
payoff  higher  than  W  or  Z.  However,  the  reader  can  check  that  all  strategy  pairs  of 
the  form  ( T ,  ( q ,  0,  0, 1  —  q))  (0  <  q  <  1)  are  also  Nash  equilibria  of  this  game. 

It  is  also  clear  from  this  example  that  a  Nash  equilibrium  does  not  have  to  result 
in  Pareto  optimal  payoffs:  a  pair  of  payoffs  is  Pareto  optimal  if  there  is  no  other 
pair  of  payoffs  which  are  at  least  as  high  for  both  players  and  strictly  higher  for 
at  least  one  player.  The  payoff  pair  (4,4),  resulting  from  (M,  Y),  is  better  for  both 
players  than  the  equilibrium  payoffs  (2,2),  resulting  from  ( T ,  W).  We  know  this 
phenomenon  already  from  the  Prisoners’  Dilemma  game  in  Sect.  1.3.2. 


3.2.2  2x2  Games 

We  demonstrate  the  graphical  solution  method  for  2  x  2  games  by  means  of  an 
example.  Consider  the  bimatrix  game 


L  R 

T  (2,2  0,1\ 

B  \l,l  3,3 )  ' 

Observe  that  this  game  has  two  Nash  equilibria  in  pure  strategies,  namely  (T,  L)  and 
(B,  R).  To  find  all  Nash  equilibria  we  determine  the  best  replies  of  both  players. 

First  consider  the  strategy  (q,  1  —  q)  of  player  2.  The  unique  best  reply  of  player 
1  to  this  strategy  is  T  or,  equivalently,  (1, 0),  if  the  expected  payoff  from  playing  T 
is  higher  than  the  expected  payoff  from  playing  B ,  since  then  it  is  also  higher  than 
the  expected  payoff  from  playing  any  combination  (p,l—p)ofT  and  B.  Hence,  the 
best  reply  is  T  if 


2 q  +  0(1  -  q)  >  Iq  +  3(1  -  q)  , 

so  if  q  >  | .  Similarly,  we  find  that  B  is  the  unique  best  reply  if  q  <  | ,  and  that  T 
and  B  are  both  best  replies  if  q  —  | .  In  the  last  case,  since  T  and  B  yield  the  same 
payoff  to  player  1  against  (q,  1  —  q),  it  follows  that  any  (p,  1  —  /?)  is  a  best  reply. 
Summarizing,  if  we  denote  the  set  of  best  replies  of  player  1  against  (q,  1  —  q)  by 
(q,  1  —  q),  we  have 


0  <  p  <  1} 


if  f  <  q  <  1 
if  <7  =  | 
if  0  <  q  <  |  . 


({(1,0)} 

1  -q)  =  \  {(p,  1  ~p) 

l  {(0,1)} 


(3.1) 
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By  analogous  arguments,  we  find  that  for  a  strategy  (p,  1  —  p)  the  best  replies 
faip,  1  —  p)  of  player  2  are  given  by 

(  {(1,0)}  if  |  <  /*  —  1 

fcip,  1  ~p)=  (  {(<?,  1  -g)  |  0  <  q  <  1}  if/?  =  |  (3.2) 

l  {(0,1)}  if0<p<|. 

By  definition,  the  Nash  equilibria  of  the  game  are  the  strategy  combinations  (p* ,  q*) 
such  that  p*  G  /3i(q*)  and  q*  G  /^(p*),  he.,  the  points  of  intersection  of  the 
best  reply  functions  in  (3.1)  and  (3.2).  A  convenient  way  to  find  these  points  is  by 
drawing  the  graphs  of  fi\ ( q ,  1  —  q)  and  /^(p,  1  —  p).  We  put  p  on  the  horizontal  axis 
and  q  on  the  vertical  axis  and  obtain  the  following  diagram. 


3/4 


q 


o 


p  2/3 


The  solid  black  curve  depicts  the  best  replies  of  player  1  and  the  solid  grey  curve 
depicts  the  best  replies  of  player  2.  The  solid  circles  indicate  the  three  Nash 
equilibria  of  the  game:  ((1,0),  (1,0)),  ((2/3, 1/3),  (3/4, 1/4)),  and  ((0, 1),  (0, 1)). 


3.2.3  Strict  Domination 

The  graphical  method  discussed  in  Sect.  3.2.2  is  suited  for  2  x  2  games.  It  can  be 
extended  to  2  x  3  and  3x2  games  as  well,  see  Sect.  13.2.2. 

In  general,  for  the  purpose  of  finding  Nash  equilibria  the  size  of  a  game  can 
sometimes  be  reduced  by  iteratively  eliminating  strictly  dominated  strategies.  We 
look  for  a  strictly  dominated  (pure)  strategy  of  a  player,  eliminate  the  associated  row 
or  column,  and  continue  this  procedure  for  the  smaller  game  until  there  is  no  more 
strictly  dominated  strategy.  It  can  be  shown  (see  Sect.  13.3)  that  no  pure  strategy 
that  is  eliminated  by  this  procedure  is  ever  played  with  positive  probability  in  a 
Nash  equilibrium  of  the  original  game.  Thus,  no  Nash  equilibrium  of  the  original 
game  is  eliminated.  Also,  no  Nash  equilibrium  is  added.  It  follows,  in  particular, 
that  the  order  in  which  strictly  dominated  strategies  are  eliminated  does  not  matter. 

For  completeness  we  first  repeat  the  definition  of  strict  domination,  formulated 
for  a  bimatrix  game,  and  then  present  an  example. 
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Definition  3.4  (Strict  Domination)  Let  (A,B)  be  an  m  x  n  bimatrix  game  and 
i  a  row.  The  pure  strategy  el  is  strictly  dominated  if  there  is  a  strategy  p  = 
(pi, . . .  ,pm)  G  Am  such  that  pAeJ  >  e*Ae7  for  every  j  =  Similarly, 

let  j  be  a  column.  The  pure  strategy  e-7  is  strictly  dominated  if  there  is  a  strategy 
q  =  (q\, ...  ,qn)  G  An  such  that  elBq  >  elBe^  for  every  i  =  1 , . . . ,  m.  □ 


We  observe  that  Remark  2.6  is  still  valid:  if  el  for  player  1  is  strictly  dominated,  then 
it  is  strictly  dominated  by  some  p  with  pt  =  0;  and  similar  for  player  2. 

Consider  the  following  bimatrix  game 


T 

M 

B 


w 

X 

Y 

z 

2,2 

2,1 

2,2 

o,o\ 

1,0 

4,1 

2,4 

1,5  . 

0,4 

3,1 

3,0 

3,3/ 

Observe,  first,  that  no  pure  strategy  (row)  of  player  1  is  strictly  dominated  by  another 
pure  strategy  of  player  1 ,  and  that  no  pure  strategy  (column)  of  player  2  is  strictly 
dominated  by  another  pure  strategy  of  player  2.  Consider  the  pure  strategy  X  of 
player  2.  Note  that  the  payoffs  in  column  X  for  player  2  are  below  the  maximum  of 
the  payoffs  in  columns  W  and  Y:  1  <  max{2,  2},  1  <  max{0,  4},  and  1  <  max{4, 0}. 
Therefore,  we  may  try  and  see  if  X  can  be  strictly  dominated  by  a  combination  of 
W  and  Y,  i.e.,  by  a  strategy  of  the  form  (q,0,  1  —  q,  0).  For  this  we  need:  1  < 
2 q  +  2(1  —  q),  1  <  0 q  +  4(1  —  q ),  and  1  <  4q  +  0(1  —  q).  These  three  inequalities 
hold  for  all  q  with  \  <  q  <  !•  For  instance,  X  is  strictly  dominated  by  (|,  0,  0). 

So  X  can  be  eliminated,  to  obtain 


W  Y  Z 

T  / 2,2  2,2  0, 0\ 

M  1,0  2,4  1,5  . 

B  y 0, 4  3,0  3,3/ 


Next,  in  this  reduced  game,  for  player  1  pure  strategy  M  is  strictly  dominated  by  any 
strategy  of  the  form  (p,  0, 1  —  p)  with  2  <P  <  3-  So  M  can  be  eliminated  to  obtain 

W  Y  Z 
T  (2,2  2,2  0,0\ 

B  VO, 4  3,0  3,3  J  ' 

Here,  finally,  Z  can  be  eliminated  since  it  is  strictly  dominated  by  W,  and  we  are  left 
with  the  2x2  game 


T 

B 


W  Y 

2,2  2,2 

0,4  3,0 
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This  game  can  be  solved  by  the  graphical  method  of  Sect.  3.2.2.  Doing  so  results  in 
the  diagram 


1 


q 

1/3 


0 

The  solid  black  (grey)  curve  depicts  player  l’s  (2’s)  best  replies.  In  this  case,  the 
curves  overlap  in  infinitely  many  points,  resulting  in  the  set  of  Nash  equilibria 
{((1,0),  (?,1  -q))  I  1/3  <  q  <  1}.  In  the  original  3x4  game,  the  set  of  all 
Nash  equilibria  is  therefore  equal  to 

{((1,0,0),  (q,0,  1-4,0))  |  1/3  <q<  1}  . 


P 


3.3  Problems 

3.1.  Some  Applications 

In  each  of  the  following  situations,  set  up  the  corresponding  bimatrix  game  and 

solve  for  all  Nash  equilibria. 

(a)  Pure  coordination.  Two  firms  (Smith  and  Brown)  decide  whether  to  design  the 
computers  they  sell  to  use  large  or  small  floppy  disks.  Both  players  will  sell  more 
computers  if  their  disk  drives  are  compatible.  If  they  both  choose  for  large  disks 
the  payoffs  will  be  2  for  each.  If  they  both  choose  for  small  disks  the  payoffs 
will  be  1  for  each.  If  they  choose  different  sizes  the  payoffs  will  be  —1  for  each. 

(b)  The  welfare  game.  This  game  models  a  government  that  wishes  to  aid  a  pauper  if 
he  searches  for  work  but  not  otherwise,  and  a  pauper  who  searches  for  work  only 
if  he  cannot  depend  on  government  aid,  and  who  may  not  succeed  in  finding 
a  job  even  if  the  tries.  The  payoffs  are  3,2  (for  government,  pauper)  if  the 
government  aids  and  the  pauper  tries  to  work;  —1,1  if  the  government  does 
not  aid  and  the  pauper  tries  to  work;  —  1 , 3  if  the  government  aids  and  the  pauper 
does  not  try  to  work;  and  0,  0  in  the  remaining  case.  [These  payoffs  represent 
the  preferences  of  the  players,  rather  than  monetary  values.  For  instance, 
the  government  ranks  the  combination  Aid/Search  above  the  combination  No 
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Aid/No  Search,  which  in  turn  is  ranked  above  the  combinations  Aid/No  Search 
and  No  Aid/Search,  between  which  the  government  is  indifferent.] 

(c)  Wage  game .  Each  of  two  firms  has  one  job  opening.  Suppose  that  firm  i  (i  =  1 , 2) 
offers  wage  w;,  where  0  <  \w\  <  W2  <  2w\  and  w\  ^  W2.  Imagine  that 
there  are  two  workers,  each  of  whom  can  apply  to  only  one  firm.  The  workers 
simultaneously  decide  whether  to  apply  to  firm  1  or  firm  2.  If  only  one  worker 
applies  to  a  given  firm,  that  worker  gets  the  job;  if  both  workers  apply  to  the 
same  firm,  each  worker  has  probability  1/2  of  getting  the  job  while  the  other 
worker  remains  unemployed  and  has  a  payoff  of  zero. 

(d)  Marketing  game.  Two  firms  sell  a  similar  product.  Each  percent  of  market  share 
yields  a  net  payoff  of  1 .  Without  advertising  both  firms  have  50  %  of  the  market. 
The  cost  of  advertising  is  equal  to  10  but  leads  to  an  increase  in  market  share  of 
20  %  at  the  expense  of  the  other  firm.  The  firms  make  their  advertising  decisions 
simultaneously  and  independently.  The  total  market  for  the  product  is  of  fixed 
size. 

(e)  Voting  game.  Two  political  parties,  I  and  //,  each  have  three  votes  that  they 
can  distribute  over  three  party-candidates  each.  A  committee  is  to  be  elected, 
consisting  of  three  members.  Each  political  party  would  like  to  see  as  many  as 
possible  of  their  own  candidates  elected  in  the  committee.  Of  the  total  of  six 
candidates,  those  three  who  have  most  of  the  votes  will  be  elected;  in  case  of 
ties,  tied  candidates  are  drawn  with  equal  probabilities. 

(f)  Voting  game,  revisited.  Consider  the  situation  in  (e)  but  now  assume  that  each 
party  is  risk  averse.  For  instance,  each  party  strictly  prefers  to  have  one  candidate 
for  sure  in  the  committee  over  a  lottery  in  which  it  has  zero  or  two  candidates 
in  the  committee  each  with  probability  50  %.  Model  this  by  each  party  having 
a  payoff  of  *Jc  for  a  sure  number  c  e  {0, 1,2,3}  of  its  candidates  in  the 
committee. 

3.2.  Matrix  Games 

(a)  Since  a  matrix  game  is  a  special  case  of  a  bimatrix  game,  it  may  be  analyzed 
by  the  graphical  method  of  Sect.  3.2.2.  Do  this  for  the  game  in  Problem  2.1(a). 
Compare  your  answer  with  what  you  found  previously. 

(b)  Argue  that  a  pair  consisting  of  a  maximin  and  a  minimax  strategy  in  a  matrix 
game  is  a  Nash  equilibrium;  and  that  any  Nash  equilibrium  in  a  matrix  game 
must  be  a  pair  consisting  of  a  maximin  and  a  minimax  strategy.  (You  may  give 
all  your  arguments  in  words.) 

(c)  Define  a  maximin  strategy  for  player  1  in  the  bimatrix  game  ( A,B )  to  be  a 
maximin  strategy  in  the  matrix  game  A.  Which  definition  is  appropriate  for 
player  2  in  this  respect?  With  these  definitions,  find  examples  showing  that 
a  Nash  equilibrium  in  a  bimatrix  game  does  not  have  to  consist  of  maximin 
strategies,  and  that  a  maximin  strategy  does  not  have  to  be  part  of  a  Nash 
equilibrium. 
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3.3.  Strict  Domination 
Consider  the  bimatrix  game 

W  X  Y  Z 

T  ( 6,6  4,4  1,2  8,5\ 

B\4,5  6,6  2,8  4,4) 

(a)  Which  pure  strategy  of  player  1  or  player  2  is  strictly  dominated  by  a  pure 
strategy? 

(b)  Describe  all  combinations  of  strategies  W  and  Y  of  player  2  that  strictly 
dominate  X. 

(c)  Find  all  Nash  equilibria  of  this  game. 

3.4.  Iterated  Elimination  (1) 

Consider  the  bimatrix  game 


w 

X 

Y 

z 

A 

5,4 

4,4 

4,5 

12, 2\ 

B 

3,7 

8,7 

5,8 

10,6 

C 

2, 10 

7,6 

4,6 

9,5 

D 

4,4 

5,9 

4, 10 

10,9/ 

(a)  Find  a  few  different  ways  in  which  strictly  dominated  strategies  can  be  iteratedly 
eliminated  in  this  game. 

(b)  Find  the  Nash  equilibria  of  this  game. 


3.5.  Iterated  Elimination  (2) 
Consider  the  bimatrix  game 


2,0 

3,4 

1,3 


1,1 

1,2 

0,2 


Find  the  Nash  equilibria  of  this  game. 


3.6.  Weakly  Dominated  Strategies 

A  pure  strategy  i  of  player  1  in  an  m  x  n  bimatrix  game  (A,  B )  is  weakly  dominated 
if  there  a  strategy  p  =  (p\, . . .  ,pm)  G  Am  such  that  pAeJ  >  e*Ae;  for  every 
j  —  l, ...  ,n,  and  pAeJ  >  e*AeJ  for  at  least  one  j.  The  definition  of  a  weakly 
dominated  strategy  of  player  2  is  similar.  In  words,  a  pure  strategy  is  weakly 
dominated  if  there  is  some  pure  or  mixed  strategy  that  is  always  at  least  as  good, 
and  that  is  better  against  at  least  one  pure  strategy  of  the  opponent.  Instead  of 
iterated  elimination  of  strictly  dominated  strategies  one  might  also  consider  iterated 
elimination  of  weakly  dominated  strategies.  The  advantage  is  that  in  games  where 
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no  strategy  is  strictly  dominated  it  might  still  be  possible  to  eliminate  strategies  that 
are  weakly  dominated.  However,  some  Nash  equilibria  of  the  original  game  may  be 
eliminated  as  well,  and  also  the  order  of  elimination  may  matter.  These  issues  are 
illustrated  by  the  following  examples. 

(a)  Consider  the  bimatrix  game 


A 

B 

C 


X 

Y 

Z 

11, 10 

6,9 

10, 9\ 

11,6 

6,  6 

9,6  . 

12, 10 

6,9 

9,11/ 

First,  determine  the  pure  Nash  equilibria  of  this  game.  Next,  apply  iterated 
elimination  of  weakly  dominated  strategies  to  reduce  the  game  to  a  2  x  2  game 
and  determine  the  unique  Nash  equilibrium  of  this  smaller  game. 

(b)  Consider  the  bimatrix  game 


Y 

Z 

A  1 

(hi 

0,0 

2, 0\ 

B 

1.2 

1,2 

1,1  • 

C  ' 

lo,o 

1,1 

hi 

Show  that  different  orders  of  eliminating  weakly  dominated  strategies  may 
result  in  different  Nash  equilibria. 

3.7.  A  Parameter  Game 
Consider  the  bimatrix  game 


L  R 

T  (  1,1  a,  0\ 

B  \0,0  2,  1 ) 

where  a  e  R.  Determine  the  Nash  equilibria  of  this  game  for  every  possible  value 

of  a. 

3.8.  Equalizing  Property  of  Mixed  Equilibrium  Strategies 

(a)  Consider  again  the  game  of  Problem  3.3,  which  has  a  unique  Nash  equilibrium 
in  mixed  strategies.  In  this  equilibrium,  player  1  puts  positive  probability  p*  on 
T  and  1  —  p*  on  B ,  and  player  2  puts  positive  probability  q*  on  W  and  1  —  q* 
on  Y.  Show  that,  if  player  2  plays  this  strategy,  then  both  T  and  B  give  player 
1  the  same  expected  payoff,  equal  to  the  equilibrium  payoff.  Also  show  that,  if 
player  1  plays  his  equilibrium  strategy,  then  both  W  and  Y  give  player  2  the  same 
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expected  payoff,  equal  to  the  equilibrium  payoff,  and  higher  than  the  expected 
payoff  from  X  or  from  Z. 

(b)  Generalize  the  observations  made  in  (a),  more  precisely,  give  an  argument  for 
the  following  statement: 

Let  (A,  B)  be  an  m  x  n  bimatrix  game  and  let  (p*,  q*)  be  a  Nash  equilibrium. 
Then  each  row  played  with  positive  probability  in  this  Nash  equilibrium  has  the 
same  expected  pay  off  for  player  1  against  q*  and  this  payoff  is  at  least  as  high 
as  the  payoff  from  any  other  row.  Each  column  played  with  positive  probability 
in  this  Nash  equilibrium  has  the  same  expected  payoff  for  player  2  against  p* 
and  this  payoff  is  at  least  as  high  as  the  pay  off  from  any  other  column. 

You  may  state  your  argument  in  words,  without  using  formulas. 

3.9.  Voting 

Suppose  the  spectrum  of  political  positions  is  described  by  the  closed  interval 
(line  segment)  [0,5].  Voters  are  uniformly  distributed  over  [0,5].  There  are  two 
candidates,  who  may  occupy  any  of  the  positions  in  the  set  {0,  1, 2,  3,  4,  5}.  Voters 
will  always  vote  for  the  nearest  candidate.  If  the  candidates  occupy  the  same  position 
they  each  get  half  of  the  votes.  The  candidates  simultaneously  and  independently 
choose  positions.  Each  candidate  wants  to  maximize  the  number  of  votes  for 
him/herself.  Only  pure  strategies  are  considered. 

(a)  Model  this  situation  as  a  bimatrix  game  between  the  two  candidates. 

(b)  Determine  the  best  replies  of  both  players. 

(c)  Determine  all  Nash  equilibria  (in  pure  strategies),  if  any. 

We  now  change  the  situation  as  follows.  Candidate  1  can  only  occupy  the  positions 
1,3,5,  and  candidate  2  can  only  occupy  the  positions  0,2,4. 

(d)  Answer  questions  (a),  (b),  and  (c)  for  this  new  situation. 

(e)  The  two  games  above  are  constant- sum  games.  How  would  you  turn  them 
into  zero-sum  games  without  changing  the  strategic  possibilities  (best  replies, 
equilibria)?  What  would  be  the  value  of  these  games  and  the  (pure)  optimal 
strategies? 

3.10.  Guessing  Numbers 

Players  1  and  2  each  choose  a  number  from  the  set  { 1 , . . . ,  K).  If  the  players  choose 
the  same  number,  then  player  2  pays  1  Euro  to  player  1;  otherwise  no  payment 
is  made.  The  players  can  use  mixed  strategies  and  each  player’s  preferences  are 
determined  by  his  or  her  expected  monetary  payoff. 

(a)  Show  that  each  player  playing  each  pure  strategy  with  probability  ^  is  a  Nash 
equilibrium. 

(b)  Show  that  in  every  Nash  equilibrium,  player  1  must  choose  every  number  with 
positive  probability. 

(c)  Show  that  in  every  Nash  equilibrium,  player  2  must  choose  every  number  with 
positive  probability. 
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(d)  Determine  all  (mixed  strategy)  Nash  equilibria  of  the  game. 

(e)  This  is  a  zero-sum  game.  What  are  the  optimal  strategies  and  the  value  of  the 
game? 

3.11.  Bimatrix  Games 


(a)  Give  an  example  of  a  2  x  2-bimatrix  game  with  exactly  two  Nash  equilibria  in 
pure  strategies  and  no  other  Nash  equilibrium.  For  your  example,  determine  the 
players’  reaction  functions  and  make  a  picture,  showing  that  your  example  is  as 
desired. 

(b)  Consider  the  following  bimatrix  game: 


(A,B) 


a,b  c,d\ 
e,f  g,h)  ' 


Assume  that  a  >  e.  Give  necessary  and  sufficient  further  conditions  on  the 
payoffs  such  that  the  game  has  no  pure  strategy  Nash  equilibria.  Under  these 
conditions,  determine  all  mixed  Nash  equilibria  of  the  game. 


3.4  Notes 

For  finite  two-person  games,  Nash  (1951)  proved  that  every  game  has  a  Nash 
equilibrium  in  mixed  strategies.  The  term  ‘strategic  uncertainty’  can  already  be 
found  in  von  Neumann  and  Morgenstern  (1944/1947).  For  general  bimatrix  games, 
Nash  equilibria  can  be  found  by  Nonlinear  Programming  methods,  see  Chap.  13. 

Problems  3.1  (a,  b)  are  taken  from  Rasmusen  (1989),  and  Problem  3.1(c)  from 
Gibbons  (1992).  Problem  3.4  is  taken  from  Watson  (2002). 
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Finite  Extensive  Form  Games 


Most  games  derived  from  economic  or  political  situations  have  in  common  with 
most  parlor  games  (like  card  games  and  board  games)  that  they  are  not  ‘one-shot’: 
players  move  sequentially,  and  one  and  the  same  player  may  move  more  often  than 
once.  Such  games  are  best  described  by  drawing  a  decision  tree  which  tells  us  whose 
move  it  is  and  what  a  player’s  information  is  when  that  player  has  to  make  a  move. 

In  this  chapter  these  ‘games  in  extensive  form’  are  studied.  Attention  is  restricted 
to  games  with  finitely  many  players  (usually  two),  finitely  many  decision  moments 
and  finitely  many  moves.  See  Sect.  1.3.3  for  a  few  examples.  We  also  assume  that 
each  player  has  ‘complete  information’:  this  means  that  either  there  is  no  chance 
move  in  the  game  or,  if  there  is  one,  there  is  no  player  who  still  has  to  move  after 
the  chance  move  and  is  not  informed  about  the  outcome  of  the  chance  move.  This 
excludes,  for  instance,  the  game  of  entry  deterrence  with  incomplete  information  in 
Sect.  1.3.3,  but  it  also  excludes  most  card  games — most  card  games  start  with  cards 
being  dealt,  which  is  a  chance  move.  For  the  analysis  of  games  with  incomplete 
information  see  Chap.  5.  Chapter  14  extends  the  analysis  of  the  present  and  the  next 
chapter. 

The  first  section  of  this  chapter  introduces  games  in  extensive  form.  In  order 
to  avoid  a  load  of  cumbersome  notation  the  treatment  will  be  somewhat  informal 
but — hopefully — not  imprecise.  In  Sect.  4.2  we  define  strategies  and  the  ‘strategic 
form’  of  a  game:  the  definition  of  Nash  equilibrium  for  extensive  form  games  is  then 
practically  implied.  The  focus  in  this  chapter  is  on  pure  Nash  equilibrium. 

In  the  third  section  the  concept  of  Nash  equilibrium  is  refined  by  considering 
backward  induction  and  subgame  perfection.  A  further  important  refinement,  called 
‘perfect  Bayesian  equilibrium’,  is  treated  in  the  fourth  section. 
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4.1  The  Extensive  Form 

A  game  in  extensive  form  is  described  by  a  game  tree.  Such  a  game  tree  is 
characterized  by  nodes  and  edges.  Each  node  is  either  a  decision  node  of  a  player, 
or  a  chance  node ,  or  an  end  node.  Each  edge  corresponds  to  either  an  action  of  a 
player  or  a  choice  made  by  chance,  sometimes  called  a  ‘move  of  Nature’. 

Figure  4.1  illustrates  these  and  other  concepts. 

The  upper  node  in  the  tree,  the  root  of  the  tree,  is  a  decision  node  of  player  1  and 
the  starting  point  of  the  game.  Player  1  chooses  between  three  actions,  namely  A, 
B ,  and  C.  Player  2  learns  that  player  1  has  chosen  either  one  of  the  actions  A  and  B , 
or  action  C.  The  first  event  is  indicated  by  the  dashed  line  connecting  the  two  left 
decision  nodes  of  player  2.  In  that  case,  player  2  has  two  actions,  namely  /  and  r.  We 
call  the  two  connected  nodes  an  information  set  of  player  2:  player  2  knows  that  the 
play  of  the  game  has  arrived  at  one  of  these  nodes  but  he  does  not  know  at  which 
one.  The  fact  that  player  2  has  the  same  set  of  actions  at  each  of  the  two  nodes  in  this 
information  set  is  a  necessary  consequence:  if  this  were  not  the  case,  player  2  would 
know  at  which  node  he  was  (i.e.,  would  know  whether  player  1  would  have  played 
A  or  B)  by  simply  examining  the  set  of  available  actions,  which  would  go  against  the 
interpretation  of  an  information  set.  This  last  argument  is  one  consequence  of  the 
more  general  assumption  that  the  whole  game  tree  is  common  knowledge  between 
the  players:  each  player  knows  it,  knows  that  the  other  player(s)  know(s)  it,  knows 
that  the  other  player(s)  know(s)  that  he  knows  it,  and  so  on. 

If  player  1  plays  C,  then  there  is  a  chance  move,  resulting  with  probability  1/4 
in  a  decision  node  of  player  2  (following  U)  and  with  probability  3/4  in  a  decision 
node  of  player  1  (following  D ).  At  player  2’s  decision  node  this  player  has  two 
actions,  namely  L  and  R.  At  player  1  ’s  decision  node  this  player  also  has  two  actions, 
namely  a  and  b.  All  the  remaining  nodes  are  end  nodes,  indicated  by  payoff  pairs, 
where  the  upper  number  is  the  payoff  to  player  1  and  the  lower  number  the  payoff 
to  player  2.  In  this  diagram,  the  payoffs  are  written  as  column  vectors,  but  we  also 
write  them  as  row  vectors,  whatever  is  convenient  in  a  given  situation. 


Fig.  4.1  A  game  in  extensive  form 
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Fig.  4.2  The  game  of  Fig.  4.1,  now  with  perfect  information 


Also  the  singleton  decision  nodes  are  called  information  sets.  Thus,  in  this  game, 
each  player  has  two  different  information  sets.  An  information  set  is  nontrivial  if  it 
consists  of  at  least  two  nodes.  Games  with  nontrivial  information  sets  are  called 
games  with  imperfect  information.  If  a  game  has  only  trivial  information  sets, 
then  we  say  that  it  has  perfect  information .  If  we  change  the  present  example  by 
assuming  that  player  2  observes  whether  player  1  chooses  A  or  B ,  then  the  game 
has  perfect  information.  See  Fig.  4.2.  We  have  renamed  the  actions  of  player  2  after 
action  B  of  player  1  for  later  convenience:  these  actions  can  now  be  regarded  as 
different  from  player  2’s  actions  following  A,  since  player  2  knows  that  player  1  has 
chosen  B. 

The  chance  move  in  our  example  is  not  a  particularly  interesting  one,  since  the 
players  learn  what  the  outcome  of  the  chance  move  is.  (The  situation  is  different  if 
at  least  one  player  is  not  completely  informed  about  the  outcome  of  a  chance  move 
and  if  this  lack  of  information  has  strategic  consequences.  In  that  case,  we  talk  about 
games  with  ‘incomplete’  information,  see  Chap.  5.) 

As  mentioned  before,  we  do  not  give  a  formal  definition  of  a  game  in  extensive 
form:  the  examples  in  Figs. 4.1  and  4.2  illustrate  the  main  ingredients  of  such  a 
game.  An  important  condition  is  that  the  game  tree  should  be  a  tree  indeed:  it 
should  have  a  single  root  and  no  ‘cycles’.  This  means  that  a  situation  as  for  instance 
in  Fig.  4.3a  is  not  allowed. 

We  also  restrict  attention  to  games  in  extensive  form  that  have  perfect  recall :  each 
player  remembers  what  he  did  in  the  past.  For  instance,  the  situation  in  Fig.  4.3b, 
where  player  1  at  his  lower  information  set  does  not  recall  which  action  he  took 
earlier,  is  not  considered.* 2 


^ee  Chap.  14  for  a  formal  definition. 

2 The  assumption  of  perfect  recall  plays  a  particular  role  for  the  relation  between  mixed  and 
behavioral  strategies,  see  Chap.  14. 
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Fig.  4.3  An  example  of  a  cycle  (a),  and  of  a  game  without  perfect  recall  (b) 


4.2  The  Strategic  Form 

In  a  game  in  extensive  form,  it  is  extremely  important  to  distinguish  between  actions 
and  strategies.  An  action  is  a  possible  move  of  a  player  at  an  information  set.  In  the 
games  in  Figs.  4.1  and  4.2  player  1  has  the  actions  A,  B ,  and  C,  and  a  and  b\  and 
player  2  has  the  actions  /  and  r,  and  L  and  R.  In  contrast, 

a  strategy  is  a  complete  plan  to  play  the  game. 

This  is  one  of  the  most  important  concepts  in  game  theory.  In  the  games  in  Figs.  4.1 
and  4.2,  a  possible  strategy  for  player  1  is: 

Start  by  playing  C;  if  the  chance  move  of  the  game  results  in  D ,  then  play  b. 

Another  strategy  of  player  1  is: 

Start  by  playing  A;  if  the  chance  move  of  the  game  results  in  D,  then  play  b. 

The  last  strategy  might  look  strange  since  player  l’s  first  action  A  precludes  him 
having  to  take  any  further  action.  Nevertheless,  also  this  plan  is  regarded  as  a 
possible  strategy. 

A  possible  strategy  for  player  2  in  the  game  of  Fig.  4.1  is: 

Play  /  if  player  1  plays  A  or  B ,  and  play  L  if  player  1  plays  C  and  the  chance 
move  results  in  U. 

Note  that  player  2  cannot  make  his  action  contingent  on  whether  player  1  plays  A 
or  B ,  since  player  2  does  not  have  that  information.  In  the  perfect  information  game 


Although  there  is  not  much  lost  if  we  would  exclude  such  strategies — as  some  authors  do. 
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of  Fig.  4.2,  however,  player  2’s  strategy  should  tell  what  player  2  plays  after  A  and 
what  he  plays  after  B.  A  possible  strategy  would  then  be: 

Play  /  if  player  1  plays  A,  play  r'  if  player  1  plays  B ,  and  play  L  if  player  1 
plays  C  and  the  chance  move  results  in  U. 

A  formal  definition  of  a  strategy  of  a  player  is: 

a  strategy  is  a  list  of  actions,  exactly  one  at  each  information  set  of  that  player. 

In  both  our  examples,  a  strategy  of  player  1  is  therefore  a  list  of  two  actions  since 
player  1  has  two  information  sets.  The  number  of  possible  strategies  of  player  1  is 
the  number  of  different  lists  of  actions.  Since  player  1  has  three  possible  actions  at 
his  first  information  set  and  two  possible  actions  at  his  second  information  set,  this 
number  is  equal  to  3  x  2  =  6.  The  strategy  set  of  player  1  can  be  denoted  as 

{Aa,  Ab ,  Ba ,  Bb,  Ca,  Cb}  . 

Similarly,  in  the  imperfect  information  game  in  Fig.  4.1  player  2  has  2x2  =  4 
different  strategies,  and  his  strategy  set  can  be  denoted  as 

{IL,  IR,  rL,  rR}  . 

In  the  perfect  information  game  in  Fig.  4.2  player  2  has  three  information  sets  and 
two  actions  at  each  information  set,  so  2x2x2  =  8  different  strategies,  and  his 
strategy  set  can  be  denoted  as 

{ll'L,  ll'R,  IrL,  Ir'R,  rl'L,  rl'R,  rr'L ,  rr'R}  . 

In  general,  it  is  important  to  distinguish  between  a  strategy  combination  and  the 
associated  outcome.  The  outcome  is  the  induced  play  of  the  game,  that  is,  the  path 
followed  in  the  game  tree.  For  instance,  the  strategy  combination  (Aa,  ll'L)  in  the 
game  in  Fig.  4.2  induces  the  outcome  (A,  /)  and  the  payoffs  (4,  1). 

There  are  several  reasons  why  we  are  interested  in  strategies.  The  main  reason 
is  that  by  considering  strategies  the  extensive  form  game  is  effectively  reduced  to  a 
one-shot  game.  Once  we  fix  a  profile  (in  the  present  example,  pair)  of  strategies  we 
can  compute  the  payoffs  by  following  the  path  followed  in  the  game  tree.  Consider 
for  instance  the  strategy  pair  (Cb,  rL)  in  the  game  in  Fig.  4. 1 .  Then  player  1  starts  by 
playing  C,  and  this  is  followed  by  a  chance  move;  if  the  result  of  this  move  is  U,  then 
player  2  plays  L;  if  the  result  is  D,  then  player  1  plays  b.  Hence,  with  probability 
1/4  the  resulting  payoff  pair  is  (0,  0)  and  with  probability  3/4  the  resulting  payoff 
pair  is  (0,  8).  So  the  expected  payoffs  are  0  for  player  1  and  6  for  player  2.  In  this 
way,  we  can  compute  the  payoffs  in  the  game  of  Fig.  4.1  resulting  from  each  of  the 
6x4  possible  strategy  combinations.  Similarly,  for  the  game  in  Fig.  4.2  we  compute 
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Fig.  4.4  The  6x4  strategic  form  of  the  game  in  Fig.  4.1  and  the  6x8  strategic  form  of  the  game 
in  Fig.  4.2 


6x8  payoff  pairs.  We  next  write  these  payoff  pairs  in  a  bimatrix,  as  in  Chap.  3.  The 
resulting  bimatrix  games  are  presented  in  Fig.  4.4. 

Such  a  bimatrix  game  is  called  the  strategic  form  of  the  extensive  form  game.  The 
definition  of  Nash  equilibrium  of  an  extensive  form  game  is  then  almost  implied: 

A  Nash  equilibrium  of  a  game  in  extensive  form  is  a  Nash  equilibrium  of  the 
strategic  form. 

This  definition  holds  for  pure  Nash  equilibria  and,  more  generally,  Nash  equilibria 
in  mixed  strategies,  but  in  this  chapter  we  restrict  attention  to  pure  strategies  and 
pure  strategy  Nash  equilibria. 

The  pure  strategy  Nash  equilibria  of  the  bimatrix  games  in  Fig.  4.4  can  be 
found  by  using  the  method  of  Sect.  3.2.1.  The  equilibria  correspond  to  the  double 
underlined  entries.  Thus,  the  imperfect  information  game  has  six  different  Nash 
equilibria  in  pure  strategies,  and  the  perfect  information  game  has  ten  different  Nash 
equilibria  in  pure  strategies. 

In  the  next  two  sections  we  examine  these  Nash  equilibria  more  closely  and 
discuss  ways  to  distinguish  between  them. 

4.3  Backward  Induction  and  Subgame  Perfection 

We  first  consider  the  perfect  information  game  of  Fig.  4.2.  This  game  can  be 
analyzed  using  the  principle  of  backward  induction.  This  means  that  we  start  with 
the  nodes  preceding  the  end  nodes,  and  turn  them  into  end  nodes  with  payoffs 
resulting  from  choosing  the  optimal  action(s).  Specifically,  player  2  chooses  r  after 
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Fig.  4.5  The  reduced  game  of  Fig.  4.2 

A  of  player  1  and  l'  after  B  of  player  1 ;  and  R  after  C  and  U.  Player  1  chooses  a  after 
D.  Thus,  we  obtain  the  reduced  game  of  Fig.  4.5.  Note  that  player  2’s  strategy  has 
already  been  completely  determined  by  this  first  step:  it  is  the  strategy  rl'R.  Player 
1  has  chosen  a  at  his  lower  information  set.  Next,  in  this  reduced  game,  player  1 
chooses  the  action(s)  that  yield(s)  the  highest  payoff.  Since  A  yields  a  payoff  of  7,  B 
a  payoff  of  6,  and  C  a(n  expected)  payoff  of  ±  •  8  +  |  •  8  =  8,  it  is  optimal  for  player 
1  to  choose  C.  Hence,  we  obtain  the  strategy  combination  ( Ca ,  rl'R )  with  payoffs 
I  (8,  4)  +  |  (8,  8)  =  (8,  7).  This  is  one  of  the  ten  Nash  equilibria  of  the  game  (see 
Fig.  4.4).  It  is  called  backward  induction  equilibrium. 

It  can  be  shown  that,  in  a  game  of  perfect  information,  applying  the  backward 
induction  principle  always  results  in  a  (pure)  Nash  equilibrium.  As  a  by-product, 
we  obtain  that  a  game  of  perfect  information  has  at  least  one  Nash  equilibrium  in 
pure  strategies,  which  can  be  obtained  by  backward  induction. 

It  is  illustrative  to  consider  backward  induction  equilibrium — in  this  game: 
(Ca,  rl'R ) — as  opposed  to  the  induced  backward  induction  outcome.  The  latter 
refers  to  the  actual  play  of  the  game  or,  equivalently,  the  equilibrium  path,  in  this 
case  (Ca,  R).  Observe  that  there  are  other  Nash  equilibria  in  this  game  that  generate 
the  same  outcome  or  path,  namely  (Ca,ll'R),  (Ca,lr'R),  and  ( Ca,rr'R ):  they  all 
generate  the  path  (Ca,  R),  but  differ  in  the  left  part  of  the  tree,  where  player  2  makes 
at  least  one  decision  that  is  not  optimal.  Hence,  the  principle  of  backward  induction 
ensures  that  every  player  always  takes  an  optimal  action,  even  in  parts  of  the  game 
tree  that  are  not  actually  reached  when  the  game  is  played. 

A  generalization  of  backward  induction  is  subgame  perfection.  The  definition  of 
a  subgame  is  as  follows: 

A  subgame  is  any  part  of  the  game  tree,  starting  at  an  information  set 
consisting  of  a  single  decision  node  (trivial  information  set)  of  a  player  or 
at  a  chance  node,  and  which  is  not  connected  to  the  rest  of  the  tree  by  any 
later  information  set. 

The  last  part  of  this  definition  is  illustrated  in  Fig.  4.6.  Although  player  2’s 
information  set  consists  of  a  single  decision  node,  no  subgame  starts  there:  this 
would  not  make  sense  since  player  3  does  not  observe  any  earlier  move  and  in 
particular  does  not  know  at  which  of  the  three  nodes  in  his  information  set  he  is. 
Formally,  the  subtree  starting  at  player  2’s  decision  node  is  still  connected  to  the 
whole  tree  by  player  3’s  information  set. 
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The  game  in  Fig.  4.2  has  six  different  subgames,  namely:  the  entire  game;  the 
game  starting  from  the  chance  move;  and  the  four  games  starting  from  the  four 
nodes  preceding  the  end  nodes.  The  definition  of  a  subgame  perfect  equilibrium  is 
as  follows: 

A  subgame  perfect  equilibrium  is  a  strategy  combination  that  induces  a  Nash 

equilibrium  in  every  subgame. 

To  see  what  this  means,  consider  again  the  game  in  Fig.  4.2.  In  order  for  a 
strategy  combination  to  be  a  subgame  perfect  equilibrium,  it  has  to  induce  a  Nash 
equilibrium  in  every  subgame.  Since  the  entire  game  is  a  subgame,  a  subgame 
perfect  equilibrium  has  to  be  a  Nash  equilibrium  in  the  entire  game,  and,  thus,  the  ten 
Nash  equilibria  in  this  game  are  the  candidates  for  a  subgame  perfect  equilibrium. 
This  is  the  case  for  any  arbitrary  game,  and  therefore  a  subgame  perfect  equilibrium 
is  always  a  Nash  equilibrium. 

Returning  to  the  game  in  Fig.  4.2,  a  subgame  perfect  equilibrium  also  has  to 
induce  an  equilibrium  in  each  of  the  four  one-player  subgames  preceding  the  end 
nodes:  although  we  have  not  defined  Nash  equilibria  for  one-person  games,  the  only 
reasonable  definition  is  that  a  player  should  choose  the  action  that  is  optimal.  In  the 
example,  this  means  that  (from  left  to  right)  the  actions  r,  l' ,  R ,  and  a ,  should  be 
chosen.  This  implies  that  the  players  choose  optimally  also  in  the  subgame  starting 
from  the  chance  node.  Summarizing,  we  look  for  the  Nash  equilibrium  or  equilibria 
that  generate  the  mentioned  actions,  and  the  only  Nash  equilibrium  that  does  this 
is  again  ( Ca ,  rl'R).  Hence,  the  unique  subgame  perfect  equilibrium  in  this  game  is 
(' Ca ,  rl'R).  It  is  not  surprising  that  this  is  also  the  backward  induction  equilibrium:  in 
games  of  perfect  information,  backward  induction  equilibria  and  subgame  perfect 
equilibria  coincide. 

Let  us  now  consider  the  imperfect  information  version  of  the  game  in  Fig.  4.1. 
In  this  case,  backward  induction  cannot  be  applied  to  the  left  part  of  the  game  tree: 
since  player  2  does  not  know  whether  player  1  has  played  A  or  B  when  he  has  to 
choose  an  action  in  his  left  information  set,  he  cannot  optimally  choose  between  / 
and  r:  /  is  better  if  player  1  has  played  B ,  but  r  is  better  if  player  1  has  played  A. 
Considering  subgame  perfection,  the  only  subgames  are  now:  the  entire  game;  the 
two  subgames  following  U  and  D ;  and  the  subgame  starting  from  the  chance  move. 
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Fig.  4.7  A  three-player  game 

Hence,  the  restrictions  imposed  by  subgame  perfection  are  that  player  1  should  play 
a ,  player  2  should  play  R ,  and  the  strategy  combination  should  be  a  Nash  equilibrium 
in  the  entire  game.  Of  the  six  Nash  equilibria  of  the  game  (see  Fig.  4.4),  this  leaves 
the  two  equilibria  ( Ca ,  IR )  and  ( Ca ,  rR).  So  these  are  the  subgame  perfect  equilibria 
of  the  game  in  Fig.  4.1. 

We  conclude  this  section  with  an  example  which  shows  more  explicitly  than  the 
preceding  example  that  subgame  perfection  can  be  more  generally  applied  than  the 
backward  induction  principle.  Consider  the  game  in  Fig.  4.7,  which  is  a  three  player 
game  (for  a  change).  Clearly,  backward  induction  cannot  be  applied  here:  player  3 
cannot  unambiguously  determine  his  optimal  action  since  he  does  not  know  whether 
player  2  has  played  L  or  has  played  R.  For  subgame  perfection,  notice  that  this  game 
has  two  subgames:  the  entire  game;  and  the  game  starting  with  player  2’s  decision 
node.  The  latter  game  is  a  game  between  players  2  and  3  with  strategic  form 

l  r 

L  f  3,1  0,0\ 

R  \0,0  1,3  ) 

where  player  2  is  the  row  player  and  player  3  the  column  player.  This  game  has  two 
pure  Nash  equilibria,  namely  (L,  /)  and  (R,  r).  Hence,  a  subgame  perfect  equilibrium 
has  to  induce  one  of  these  two  equilibria  in  this  subgame.  Note  that  if  the  first 
equilibrium  is  played,  then  player  1  should  play  A,  yielding  him  a  payoff  of  3  rather 
than  the  payoff  of  2  obtained  by  playing  B.  If  the  other  equilibrium  is  played  in  the 
subgame,  then  player  1  should  obviously  play  B  since  A  now  yields  only  1 .  So  the 
two  subgame  perfect  equilibria  are  (A,  L,  l )  and  ( B ,  R,  r). 

Alternatively,  one  can  first  compute  the  (pure)  Nash  equilibria  of  the  entire  game. 
The  strategic  form  of  the  game  can  be  represented  as  follows,  where  the  left  matrix 
results  from  player  1  playing  A  and  the  right  matrix  from  player  1  playing  B. 
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l  r  l  r 

L  / 3,3,1_  0,0, 0\  L  (2,0,0  2,0,0\ 

7?  V°,0,0  1,1, 1)  7?  V2,0,0  2, 0, 0 y 

Best  replies  are  underlined  (for  player  1  one  has  to  compare  the  corresponding 
payoffs  over  the  two  matrices),  and  the  pure  Nash  equilibria  are  (A,  L,  /),  (B,  L,  r), 
(B,  R,  /),  and  (5,  R ,  r).  The  subgame  perfect  equilibria  are  those  where  the  combi¬ 
nation  (L,  /)  or  ( R ,  r)  is  played,  resulting  in  the  two  equilibria  found  above. 


4.4  Perfect  Bayesian  Equilibrium 

A  further  refinement  of  Nash  equilibrium  and  of  subgame  perfect  equilibrium  is 
provided  by  the  concept  of  ‘perfect  Bayesian  equilibrium’.  Consider  an  information 
set  of  a  player  in  an  extensive  form  game.  A  belief  of  that  player  on  that  information 
set  is  a  probability  distribution  over  the  nodes  of  that  information  set  or,  equivalently, 
over  the  actions  leading  to  that  information  set.  Of  course,  if  the  information  set  is 
trivial  (consists  of  a  single  node)  then  also  the  belief  is  trivial,  namely  attaching 
probability  1  to  the  unique  node.  Our  somewhat  informal  definition  of  a  perfect 
Bayesian  equilibrium  is  as  follows. 

A  perfect  Bayesian  equilibrium  in  an  extensive  form  game  is  a  combination  of 
strategies  and  a  specification  of  beliefs  such  that  the  following  two  conditions 
are  satisfied: 

(i)  Bayesian  consistency:  the  beliefs  are  consistent  with  the  strategies  under 
consideration; 

(ii)  sequential  rationality:  the  players  choose  optimally  given  their  beliefs. 

The  first  condition  says  that  the  beliefs  should  satisfy  Bayesian  updating  with 
respect  to  the  strategies  whenever  possible.  The  second  condition  says  that  a 
player  should  maximize  his  expected  payoff  given  his  beliefs.  In  order  to  see  what 
these  conditions  mean  exactly,  we  consider  some  examples.  (Formal  definitions  are 
provided  in  Chap.  14.) 

Consider  the  game  in  Fig.  4.8,  which  is  identical  to  the  game  in  Fig.  4.1. 
This  game  has  one  nontrivial  information  set.  Suppose  player  2’s  belief  at  this 
information  set  is  given  by  the  probabilities  ot  at  the  left  node  and  1  —  a  at  the 
right  node,  where  0  <  ot  <  1 .  That  is,  if  this  information  set  is  reached  then  player 

2  attaches  probability  ot  to  player  1  having  played  A  and  probability  1  —  a  to  player 
1  having  played  B.  All  the  other  information  sets  are  trivial  and  therefore  the  beliefs 
attach  probability  1  to  each  of  the  corresponding  nodes.  Condition  (ii),  sequential 
rationality,  means  that  player  2  should  choose  R  and  player  1  should  choose  a  at  the 
corresponding  (trivial)  information  sets.  At  the  nontrivial  information  set,  player  2 
should  choose  the  action  that  maximizes  his  expected  payoff,  given  his  belief  at  this 
information  set.  The  expected  payoff  from  /  is  equal  to  a  •  1  +  (1  —  a)  •  3  =  3  —  lot 
and  the  expected  payoff  from  r  is  o'  •  2  +  (1  —  or)  *  0  =  lot.  Hence,  /  is  optimal  if 

3  —  lot  >  2a,  i.e.,  if  a  <  3/4  and  r  is  optimal  if  ot  >  3/4. 
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Fig.  4.8  The  extensive  form  game  of  Fig.  4. 1 


What  does  condition  (i),  Bayesian  consistency,  imply  for  this  game?  It  says  that, 
whenever  possible,  the  belief  of  player  2  at  the  left,  nontrivial  information  set  should 
be  computed  by  Bayesian  updating  using  the  strategy  of  player  1,  specifically,  using 
the  initial  action  of  player  1.  Thus,  if  the  strategy  of  player  1  prescribes  action  A, 
then  player  2  should  indeed  believe  this,  so  a  =  1.  Similarly,  if  the  strategy  of 
player  1  prescribes  action  B ,  then  l  —  a  =  lsoa  =  0.  The  formula  behind  this  is 
the  formula  for  conditional  probability: 

of  =  Prob[A  is  played  |  A  or  B  is  played] 

Prob[A  and  (A  or  B)  is  played] 

Prob[A  or  B  is  played] 

Prob[A  is  played] 

Prob[A  or  B  is  played] 

where  these  probabilities  should  be  computed  given  the  strategy  of  player  1 .  Hence, 
indeed,  if  the  strategy  of  player  1  prescribes  A  then  a  —  1/1  =  1  and  if  the 
strategy  of  player  1  prescribes  B  then  a  —  0/1  =  0.  If,  however,  player  l’s  strategy 
prescribes  C,  then  Prob[A  or  B  is  played]  =  0  and  we  cannot  use  the  formula  for 
computing  o':  the  belief  of  player  2  on  his  nontrivial  information  set  is  undetermined 
or  free. 

Remark  4. 1  We  do  not  consider  mixed  strategies  in  this  chapter,  but  suppose  we 
did  and  consider,  for  instance,  the  mixed  strategy  where  player  1  plays  Aa  with 
probability  |,  Ab  with  probability  and  Ba ,  Bb ,  Ca  and  Cb  each  with  probability 

Y2 .  Then  we  would  have  a  =  +  \  +  2  •  =  |.  In  words,  given  player 

l’s  strategy,  if  it  turns  out  that  player  1  has  played  A  or  B ,  then  player  2  should 
believe  that  player  1  has  played  A  with  probability  |  and  B  with  probability  | .  □ 
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To  complete  the  analysis  of  this  game,  note  that  it  is  always  optimal  for  player  1  to 
play  C,  given  the  actions  R  and  a  following  the  chance  move:  C  yields  8  to  player 
1  whereas  A  or  B  yield  at  most  7.  But  if  player  1  does  not  play  A  or  B ,  then,  as  we 
have  just  seen,  Bayesian  consistency  [condition  (i)]  does  not  put  any  restriction  on 
the  belief  a  of  player  2,  since  the  nontrivial  information  set  of  player  2  is  reached 
with  zero  probability.  This  means  that  a  is  free,  but  given  a  player  2  should  choose 
optimally,  as  computed  before.  Hence,  we  have  essentially  two  perfect  Bayesian 
equilibria,  namely  ( Ca ,  IR )  with  beliefs  a  <  3/4  and  ( Ca ,  rR )  with  beliefs  a  >  3/4. 
Note  that  these  are  also  the  subgame  perfect  equilibria,  now  ‘backed  up’  by  a  belief 
of  player  2  on  his  nontrivial  information  set. 

Remark  4.2  As  already  indicated,  a  perfect  Bayesian  equilibrium  is  also  a  subgame 
perfect  equilibrium,  but  in  order  to  show  this  a  more  detailed  formal  definition  is 
required.  See  Chap.  14.  □ 

In  order  to  show  that  the  perfect  Bayesian  equilibrium  requirement  can  have  an 
additional  impact  compared  to  subgame  perfection,  consider  the  variation  on  the 
game  of  Fig.  4.1,  obtained  by  replacing  the  payoffs  (4,1)  after  A  and  /  by  the  payoffs 
(4,3),  resulting  in  the  game  in  Fig.  4.9. 

One  can  check  that  the  subgame  perfect  equilibria  are  still  ( Ca ,  IR )  and  ( Ca ,  rR). 
Obviously,  a  rational  player  2  would  never  play  r  at  his  nontrivial  information  set 
since  /  is  always  better,  but  subgame  perfection  does  not  rule  this  out.  But  clearly, 
there  is  no  belief  that  player  2  could  have  at  this  information  set  that  would  make 
r  optimal:  if  we  denote  player  2’s  belief  by  (o',  1  —  a)  as  before,  then  r  yields  2a 
whereas  /  yields  3a?  +  3(1  —  a)  =  3,  which  is  always  larger  than  2a.  Hence,  the 
only  perfect  Bayesian  equilibrium  is  ( Ca ,  IR ),  with  arbitrary  free  belief  of  player  2 
at  his  nontrivial  information  set. 

We  conclude  with  another  example.  Consider  again  the  game  of  Fig.  4.7, 
reproduced  in  Fig.  4. 10  with  belief  (< a ,  1  —a)  attached  to  the  nodes  in  the  information 
set  of  player  3.  There  are  two  ways  to  find  the  perfect  Bayesian  equilibria  of  this 


Fig.  4.9  The  extensive  form  game  of  Fig.  4.1  with  payoffs  (4,3)  after  A  and  / 
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Fig.  4.10  The  three-player  game  of  Fig.  4.7  with  belief  of  player  3 


game.  One  can  consider  the  subgame  perfect  equilibria  and  find  appropriate  beliefs. 
Alternatively,  one  can  start  from  scratch  and  apply  a  form  of  backward  induction.  To 
illustrate  the  last  method,  start  with  player  3.  If  player  3  plays  /  then  his  (expected) 
payoff  is  la?  +  0(1—  a?)  =  o'.  If  player  3  plays  r  then  his  (expected)  payoff  is 
Oa  +  3(1  —  a)  =  3  —  3a.  Therefore,  /  is  optimal  if  o'  >  3/4  and  r  is  optimal  if 
a  <  3/4.  In  fact,  we  have  just  applied  condition  (ii),  sequential  rationality,  in  the 
definition  of  a  perfect  Bayesian  equilibrium. 

Now  suppose  player  3  plays  /.  Then  it  is  optimal  for  player  2  to  play  L.  If  player 
2  plays  L,  then  condition  (i)  in  the  definition  of  a  perfect  Bayesian  equilibrium, 
Bayesian  consistency,  implies  a  =  1:  that  is,  player  3  should  indeed  believe  that 
player  2  has  played  L.  Since  1  >  3/4,  /  is  the  optimal  action  for  player  3.  Player  1, 
finally,  should  play  A,  yielding  payoff  3  instead  of  the  payoff  2  resulting  from  B.  So 
we  have  a  perfect  Bayesian  equilibrium  (A,  L,  /)  with  belief  a  =  1. 

If  player  3  plays  r,  then  it  is  optimal  for  player  2  to  play  R ,  resulting  in  a  —  0 
by  Bayesian  consistency;  since  0  <  3/4,  r  is  the  optimal  action  for  player  3.  In  this 
case,  player  1  should  play  B :  this  yields  payoff  2,  whereas  A  yields  only  1.  Hence, 
we  have  a  perfect  Bayesian  equilibrium  (B,  R,  r)  with  belief  a  —  0. 


4.5  Problems 

4.1.  Counting  Strategies 

Consider  the  following  simplified  chess  game.  White  moves  first  (in  accordance 
with  the  usual  rules).  Black  observes  White’s  move  and  then  makes  its  move.  Then 
the  game  is  over  and  ends  in  a  draw.  Determine  the  strategy  sets  of  White  and  Black. 
How  many  strategies  does  Black  have?  [In  case  you  do  not  know  the  rules  of  chess: 
the  only  information  you  need  is  that  both  players  have  20  possible  actions.] 
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Fig.  4.1 1  Entry  deterrence, 
Problem  4.3 


Entrant 


40,50 


-10,0 


4.2.  Extensive  vs.  Strategic  Form 

Each  game  in  extensive  form  leads  to  a  unique  game  in  strategic  form.  The  converse, 
however,  is  not  true.  Consider  the  following  bimatrix  game: 

/1,0  1,0  0,1  0,l\ 
y  2, 0  0,2  2,0  0,2 )  ' 

Find  an  extensive  form  game  with  perfect  information  and  an  extensive  form  game 
with  imperfect  information,  both  having  this  bimatrix  game  as  their  strategic  form 
game. 

4.3.  Entry  Deterrence 

Consider  the  entry  deterrence  game  of  Chap.  1,  of  which  the  extensive  form  is 
reproduced  in  Fig.  4.11. 

(a)  Write  down  the  strategic  form  of  this  game. 

(b)  Determine  the  Nash  equilibria  (in  pure  strategies).  Which  one  is  the  backward 
induction  equilibrium?  Which  one  is  subgame  perfect?  In  which  sense  is  the 
other  equilibrium  based  on  an  ‘incredible  threat’  ? 

4.4.  Choosing  Objects 

Four  objects  O i,  O2,  O3,  and  O4  have  different  worths  for  two  players  1  and  2,  given 
by  the  following  table: 


Ox 

02 

o3 

04 

Worth  for  player  1 

1 

2 

3 

4 

Worth  for  player  2 

2 

3 

4 

1 

Player  1  starts  with  choosing  an  object.  After  him  player  2  chooses  an  object,  then 
player  1  takes  his  second  object,  and  finally  player  2  gets  the  object  that  is  left.  The 
payoff  for  a  player  is  the  sum  of  the  worths  of  the  objects  he  obtains. 

(a)  Draw  the  decision  tree  for  this  extensive  form  game. 

(b)  How  many  strategies  does  each  player  have? 
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(c)  Determine  the  backward  induction  or  subgame  perfect  equilibria  (in  pure 
strategies).  How  many  different  subgame  perfect  equilibria  are  there?  What  are 
the  associated  outcomes,  and  the  resulting  divisions  of  the  objects? 

(d)  Is  there  a  Nash  equilibrium  in  this  game  resulting  in  a  division  of  the  objects 
different  from  the  division  in  a  subgame  perfect  equilibrium? 

4.5.  A  Bidding  Game 

Players  1  and  2  bid  for  an  object  that  has  value  2  for  each  of  them.  They  both  have 
wealth  3  and  are  not  allowed  to  bid  higher  than  this  amount.  Each  bid  must  be  a 
nonnegative  integer  amount.  Besides  bidding,  each  player,  when  it  is  his  turn,  has 
the  options  to  pass  (P)  or  to  match  (M)  the  last  bid,  where  the  last  bid  is  set  at  zero 
at  the  beginning  of  the  game.  If  a  player  passes  (P),  then  the  game  is  over  and  the 
other  player  gets  the  object  and  pays  the  last  bid.  If  a  player  matches  (M),  then  the 
game  is  over  and  each  player  gets  the  object  and  pays  the  last  bid  with  probability  | . 
Player  1  starts,  and  the  players  alternate  until  the  game  is  over.  Each  new  bid  must 
be  higher  than  the  last  bid. 

(a)  Draw  the  game  tree  of  this  extensive  form  game. 

(b)  How  many  strategies  does  player  1  have?  Player  2? 

(c)  How  many  subgame  perfect  equilibria  does  this  game  have?  What  is  (are)  the 
possible  subgame  perfect  equilibrium  outcome(s)? 

(d)  Describe  all  (pure  strategy)  Nash  equilibria  of  the  game  (do  not  make  the 
strategic  form).  Is  there  any  Nash  equilibrium  that  does  not  result  in  a  subgame 
perfect  equilibrium  outcome? 

4.6.  An  Extensive  Form  Game 

For  the  game  in  Fig.  4.12,  write  down  the  strategic  form  and  compute  all  Nash 
equilibria,  subgame  perfect  equilibria,  and  perfect  Bayesian  equilibria  in  pure 
strategies. 


R 


Fig.  4.12  Extensive  form  game  of  Problem  4.6 
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R 


Fig.  4.13  Extensive  form  game  of  Problem  4.7 


Fig.  4.14  Extensive  form  game  of  Problem  4.8 


4.7.  Another  Extensive  Form  Game 

For  the  game  in  Fig.  4.13,  write  down  the  strategic  form  and  compute  all  Nash 
equilibria,  subgame  perfect  equilibria,  and  perfect  Bayesian  equilibria  in  pure 
strategies. 


4.8.  Still  Another  Extensive  Form  Game 
Consider  the  extensive  form  game  in  Fig.  4. 14. 
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C 


C 


C 


C 


C 


C 


8,  6 


2,0 


1,3 


4,2 


3,  5 


6,4 


5,7 


Fig.  4.15  The  centipede  game  of  Problem  4.9 


(a)  Determine  the  strategic  form  of  this  game. 

(b)  Determine  all  Nash  equilibria  in  pure  strategies. 

(c)  Determine  all  subgame  perfect  equilibria  in  pure  strategies. 

(d)  Determine  all  perfect  Bayesian  equilibria  in  pure  strategies. 

4.9.  A  Centipede  Game 

In  the  centipede  game,  the  two  players  move  alternatingly.  On  each  move,  a  player 
can  stop  (S)  or  continue  (C).  On  any  move,  a  player  is  better  off  stopping  the  game 
than  continuing  if  the  other  player  stops  immediately  afterward,  but  is  worse  off 
stopping  than  continuing  if  the  other  player  continues,  regardless  of  the  subsequent 
actions.  The  game  ends  after  a  finite  number  of  periods.  Consider  an  example  of  this 
game  in  Fig.  4.15. 

(a)  Determine  the  backward  induction  or  subgame  perfect  equilibrium  of  this  game. 
What  is  the  associated  outcome? 

(b)  Show  that  there  are  other  Nash  equilibria,  but  that  these  always  result  in  the 
same  outcome  as  the  subgame  perfect  equilibrium. 


4.10.  Finitely  Repeated  Prisoners  ’  Dilemma 
Consider  the  prisoners’  dilemma  game  of  Chap.  1: 

C  D 

C  /-1,-1  -10, 0\ 

D  VO, -10  -9,-9 )  ' 

Suppose  that  this  game  is  played  twice.  After  the  first  play  of  the  game  the  players 
learn  the  outcome  of  that  play.  The  final  payoff  for  each  player  is  the  sum  of  the 
payoffs  of  the  two  stages. 

(a)  Write  down  the  extensive  form  of  this  game.  How  many  strategies  does  each 
player  have?  How  many  subgames  does  this  game  have? 

(b)  Determine  the  subgame  perfect  equilibrium  or  equilibria  of  this  game.  What  if 
the  game  is  repeated  more  than  twice  but  still  finitely  many  times? 
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4.11.  A  Twice  Repeated  2x2  Bimatrix  Game 
Consider  the  following  bimatrix  game: 

L  R 

T  (2,1  1,0\ 

B  \5,2  4,4 )  ■ 

Suppose  that  the  game  is  played  twice,  and  that  after  the  first  play  of  the  game  the 
players  learn  the  outcome  of  that  play.  The  final  payoff  for  each  player  is  the  sum  of 
the  payoffs  of  the  two  stages. 

(a)  Determine  the  subgame  perfect  equilibrium  or  equilibria  of  this  game.  What  if 
the  game  is  repeated  more  than  twice  but  still  finitely  many  times? 

(b)  Exhibit  a  Nash  equilibrium  (of  the  twice  repeated  game)  where  ( B ,  L)  is  played 
in  the  first  round. 

4.12.  Twice  Repeated  3x3  Bimatrix  Games 
Consider  the  following  bimatrix  game: 


L 

M 

R 

T 

/8,8 

0,9 

o,o\ 

C 

9,0 

0,0 

3,1  . 

B 

t  0,0 

1,3 

3,3/ 

Suppose  that  the  game  is  played  twice,  and  that  after  the  first  play  of  the  game  the 
players  learn  the  outcome  of  that  play.  The  final  payoff  for  each  player  is  the  sum  of 
the  payoffs  of  the  two  stages. 

(a)  How  many  strategies  does  each  player  have  in  this  game?  How  many  subgames 
does  the  game  have? 

(b)  Describe  a  subgame  perfect  equilibrium  in  which  ( T ,  L)  is  played  in  the  first 
round. 

For  question  (c),  consider  the  bimatrix  game 


L 

M 

R 

T 

/  5,  3 

0,0 

2,0\ 

C 

0,0 

2,2 

0,0  . 

B 

1 0,0 

0,0 

0,0/ 

(c)  For  the  twice  repeated  version  of  this  game,  describe  a  subgame  perfect 
equilibrium  in  which  (B,  R)  is  played  in  the  first  round. 


References 
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4.6  Notes 

The  concept  of  subgame  perfection  was  first  formally  introduced  by  Selten  (1965, 
1975).  The  result  that  each  finite  extensive  form  game  of  perfect  information  has 
a  backward  induction  equilibrium  in  pure  strategies  is  intuitive  but  nevertheless 
somewhat  cumbersome  to  prove,  see  for  instance  Perea  (2001,  Chap.  3). 

There  is  also  a  stronger  version  of  Bayesian  consistency,  resulting  in  the  concept 
of  ‘sequential  equilibrium’,  see  Kreps  and  Wilson  (1982)  and  Chap.  14. 

The  game  in  Problem  4.12(c)  is  taken  from  Benoit  and  Krishna  (1985). 
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Finite  Games  with  Incomplete  Information 


In  a  game  of  imperfect  information  players  may  be  uninformed  about  the  moves 
made  by  other  players.  Every  one-shot,  simultaneous  move  game  is  a  game 
of  imperfect  information.  In  a  game  of  incomplete  information  players  may  be 
uninformed  about  certain  characteristics  of  the  game  or  of  the  players.  For  instance, 
a  player  may  have  incomplete  information  about  actions  available  to  some  other 
player,  or  about  payoffs  of  other  players.  Incomplete  information  is  modelled  by 
assuming  that  every  player  can  be  of  a  number  of  different  types.  A  type  of  a  player 
summarizes  all  relevant  information  (in  particular,  actions  and  payoffs)  about  that 
player.  Furthermore,  it  is  assumed  that  each  player  knows  his  own  type  and,  given 
his  own  type,  has  a  probability  distribution  over  the  types  of  the  other  players.  Often, 
these  probability  distributions  are  assumed  to  be  consistent  in  the  sense  that  they 
are  the  marginal  probability  distributions  derived  from  a  basic  commonly  known 
distribution  over  all  combinations  of  player  types. 

In  this  chapter  we  consider  games  with  finitely  many  players,  finitely  many  types, 
and  finitely  many  strategies.  These  games  can  be  either  static  (simultaneous,  one- 
shot)  or  dynamic  (extensive  form  games).  A  Nash  equilibrium  in  this  context  is 
also  called  ‘Bayesian  equilibrium’,  and  in  games  in  extensive  form  an  appropriate 
refinement  is  perfect  Bayesian  equilibrium.  As  will  become  clear,  in  essence  the 
concepts  studied  in  Chaps.  3  and  4  are  applied  again.  Throughout  this  chapter  we 
restrict  attention  to  pure  strategies  and  pure  strategy  Nash  equilibria. 

In  Sect.  5.1  we  present  a  brief  introduction  to  the  concept  of  player  types  in  a 
game,  but  the  remainder  of  the  chapter  can  also  be  understood  without  this  general 
introduction.  Section  5.2  considers  static  games  of  incomplete  information,  and 
Sect.  5.3  discusses  so-called  signaling  games,  which  is  the  most  widely  applied  class 
of  extensive  form  games  with  incomplete  information.  Both  Sects.  5.2  and  5.3  are 
based  on  examples,  rather  than  general  definitions.  For  the  latter,  see  Chap.  14. 
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5.1  Player  Types 

Consider  the  set  of  players  N  =  {l, ...  ,n}.  For  each  player  i  e  N,  there  is  a  finite 
set  Ti  of  types  which  that  player  can  have.  If  we  denote  by  T  =  T\  x  ...  xTn  the  set 

T  —  {(t\,  .  .  .  ,  tn)  |  t\  G  T\ ,  t2  G  72,  .  .  .  ,  tn  G  Tn}  , 

i.e.,  the  set  of  all  possible  combinations  of  types,  then  a  game  with  incomplete  infor¬ 
mation  specifies  a  separate  game  for  every  possible  combination  t  —  (t\, . . . ,  tn)  g 
T,  in  a  way  to  be  explained  in  the  next  sections.  We  assume  that  each  player  i  knows 
his  own  type  f  and,  given  7,  attaches  probabilities 

P(t  1  >  •  •  •  ?  U—  1  ?  fi+l ,  •  •  •  ,  tn  \  ti) 

to  all  type  combinations  t\  G  T\,  . . .,  f-\  G  Tt- 1,  G  Ti+ \,  . . .,  tn  G  Tn  of  the 

other  players. 

Often,  these  probabilities  are  derived  from  a  common  probability  distribution  p 
over  T,  where  p(t )  is  the  probability  that  the  type  combination  is  t.  This  is  also  what 
we  assume  in  this  chapter.  Moreover,  we  assume  that  every  player  i,  apart  from 
his  own  type  also  knows  the  probability  distribution  p.  Hence,  if  player  i  has 
type  ti,  then  he  can  compute  the  probability  that  the  type  combination  of  the  other 
players  is  the  vector  (t\ , . . . ,  f-i ,  ^+i, . . . ,  tn).  Formally,  this  probability  is  equal  to 
the  conditional  probability 


p(t\ ,  •  •  • ,  ti— i ,  t{-\- 1 , . . . ,  tn  t{)  — 


Pit  1  J  •  •  •  J  ti—  1  ,  ti,  ti- 1-1  ,  .  .  .  ,  tyi) 

P  it]  j  •  •  •  j  t'_]  ,  ti,  t-_ |_1  ,  •  •  •  ,  tn) 


where  the  sum  in  the  denominator  is  taken  over  all  possible  types  of  the  other 
players,  i.e.,  over  all  possible  t\  g  T\, ... ,  g  Tt- 1,  t-+1  G  Ti+ 1,  .  .  .  ,  fn  G  Tn  • 
Hence,  the  sum  in  the  denominator  is  the  probability  that  player  i  has  type  f. 

Thus,  a  player  in  a  game  of  incomplete  information  can  make  his  actions 
dependent  on  his  own  type  but  not  on  the  types  of  the  other  players.  However,  since 
he  knows  the  probabilities  of  the  other  players’  types,  he  can  compute  the  expected 
payoffs  from  taking  specific  actions.  In  the  next  two  sections  we  will  see  how  this 
works  in  static  and  in  extensive  form  games. 


5.2  Static  Games  of  Incomplete  Information 


We  discuss  a  few  examples. 


5.2  Static  Games  of  Incomplete  Information 
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5.2.1  Battle-of-the-Sexes  with  One-Sided  Incomplete  Information 

The  first  example  is  a  variant  of  the  Battle-of-the-Sexes  (see  Sect.  1.3.2)  in  which 
player  1  (the  man)  does  not  know  whether  player  2  (the  woman)  wants  to  go  out 
with  him  or  avoid  him.  More  precisely,  player  1  does  not  know  whether  he  plays 
the  game  y  (from  ‘yes’)  or  the  game  n  (from  ‘no’),  where  these  games  are  as 
follows: 


F  B  F  B 

F  (2,1  0,0)  F  (2,0  0, 2\ 

y  ‘  B  \  0,0  1,2 )  B  \  0,1  1,0/  ' 

Player  1  attaches  probability  1  /2  to  each  of  these  games,  and  player  2  knows  this. 
In  the  terminology  of  types,  this  means  that  player  1  has  only  one  type,  simply 
indicated  by  ‘1’,  and  that  player  2  has  two  types,  namely  y  and  n.  So  there  are  two 
type  combinations,  namely  (l,y)  and  (1  ,ri),  each  occurring  with  probability  1/2. 
Player  2  knows  player  1  ’s  type  with  certainty,  and  also  knows  her  own  type,  that  is, 
she  knows  which  game  is  actually  being  played.  Player  1  attaches  probability  1/2 
to  each  type  of  player  2. 

What  would  be  a  Nash  equilibrium  in  this  game?  To  see  this,  it  is  helpful  to  model 
the  game  as  a  game  in  extensive  form,  using  the  tree  representation  of  Chap.  4.  Such 
a  tree  is  drawn  in  Fig.  5.1. 

The  game  starts  with  a  chance  move  which  selects  which  of  the  two  bimatrix 
games  is  going  to  be  played.  In  the  terminology  of  types,  it  selects  the  type  of  player 
2.  Player  2  is  informed  but  player  1  is  not.  Player  2  has  four  different  strategies  but 
player  1  only  two. 

From  this  extensive  form  it  is  apparent  that  every  Nash  equilibrium  is  subgame 
perfect,  since  there  are  no  nontrivial  subgames. 

Also,  every  Nash  equilibrium  is  perfect  Bayesian,  since  the  only  nontrivial 
information  set  (namely,  that  of  player  1)  is  reached  with  positive  probability 
(namely,  equal  to  1)  for  any  strategy  of  player  2,  and  thus  the  beliefs  are  completely 


Chance 


Fig.  5.1  An  extensive  form  representation  of  the  Battle-of-the-Sexes  game  with  incomplete 
information.  The  upper  numbers  are  the  payoffs  for  player  1 
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determined  by  player  2’s  strategy  through  Bayesian  updating  (see  Sect.  4.4).  More 
precisely,  suppose  the  belief  of  player  1  is  denoted  by  the  nonnegative  vector 
(ct\  ,ct2,ci3, 04),  where  ot\  +  . . .  +  04  =  1 ,  from  left  to  right  in  Fig.  5.1.  For  instance, 
(X3  is  the  probability  attached  by  player  1  to  player  2  having  type  n  and  playing 
strategy  F.  Suppose,  for  instance,  that  player  2  plays  F  if  she  has  type  y  and  B  if 
she  has  type  n ,  and  let  E  denote  the  event  that  player  1  ’s  information  set  is  reached. 
Then 


a  1  =  Prob  [2  plays  F  and  has  type  y  \  E] 

Prob  [2  plays  F,  has  type  y,  and  E] 

Prob  [E] 

Prob  [2  plays  F  |  2  has  type  y]  Prob  [2  has  type  y]  Prob  [E] 

Prob  [E] 

1  -0.5-  1 

=  -  =  0.5  . 

1 

By  a  similar  computation  we  find  —  0, 04  —  0,  and  04  =  0.5.  Such  computations 
can  also  be  made  if  player  2  would  play  mixed  but  we  restrict  attention  here  to 
pure  strategies.  Important  is  that  the  beliefs  of  player  1  are  always  determined  by 
computing  the  conditional  probabilities  since  his  (only)  information  set  is  always 
reached  with  positive  probability,  namely  1.  Hence,  there  are  no  free  beliefs,  in  the 
terminology  of  Sect.  4.4. 

The  strategic  form  of  the  game  is  given  in  Fig.  5.2.  There,  the  first  letter  in  a 
strategy  of  player  2  says  what  player  2  plays  if  y  is  chosen  by  the  Chance  move,  and 
the  second  letter  says  what  player  2  plays  if  n  is  chosen.  For  instance,  if  player  1 
plays  F  and  player  2  plays  FF ,  then  the  expected  payoffs  are  equal  to  0.5  •  (2, 1)  + 
0.5  •  (2, 0)  =  (2, 0.5).  Or  if  player  1  plays  B  and  player  2  plays  EB ,  the  expected 
payoffs  are  equal  to  0.5  •  (0,  0)  +  0.5  •  (1 , 0)  =  (0.5, 0),  etc.  Also  the  best  replies  are 
indicated. 

From  the  strategic  form  it  is  apparent  that  the  game  has  a  unique  Nash 
equilibrium  in  pure  strategies,  namely  ( F,FB ).  In  this  equilibrium  player  1  plays 
F,  type  y  of  player  2  plays  F  and  type  n  of  player  2  plays  B. 

Another  equilibrium  concept,  appropriate  for  static  games  with  incomplete 
information,  is  that  of  a  Bayesian  equilibrium.  In  a  Bayesian  equilibrium,  each 
type  of  each  player  plays  a  best  reply  against  the  other  players.  That  is,  each 
type  of  a  player  plays  an  action  that  maximizes  his  expected  payoff,  where  the 
expectation  is  taken  over  the  type  combinations  of  the  other  players  and  their 
actions.  The  Nash  equilibrium  ( F,EB )  in  the  game  above  is  such  a  Bayesian 
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equilibrium:  the  action  F  of  type  y  of  player  2  is  a  best  reply  against  F  of 
player  1  (player  1  has  only  one  type),  the  action  B  of  type  n  of  player  2  is 
a  best  reply  against  F  of  player  1,  and  action  F  of  player  1  maximizes  player 
l’s  expected  payoff  against  the  strategy  FB  of  player  2.  In  fact,  if  every  type 
of  every  player  has  positive  probability  (as  will  be  the  case  throughout  this 
chapter),  then  the  Nash  equilibria  of  the  strategic  form  coincide  with  the  Bayesian 
equilibria. 

The  (pure)  Nash  equilibrium  or  equilibria  of  a  game  like  this  can  also  be  found 
without  drawing  the  extensive  form  and  computing  the  strategic  form,  as  follows. 
Suppose  first  that  player  1  plays  F  in  an  equilibrium.  Then  the  best  reply  of  player 
2  is  to  play  F  if  her  type  is  y  and  B  if  here  type  is  n.  The  expected  payoff  to  player  1 
is  then  1;  playing  B  against  this  strategy  FB  of  player  2  yields  only  0.5.  So  (F,  FB) 
is  a  Nash  equilibrium.  If,  on  the  other  hand,  player  1  plays  F,  then  the  best  reply 
of  player  2  if  her  type  is  y,  is  B  and  if  her  type  is  n  it  is  F.  This  yields  a  payoff  of 
0.5  to  player  1,  whereas  playing  F  against  this  strategy  BF  of  player  2  yields  payoff 
1 .  Hence,  there  is  no  equilibrium  where  player  1  plays  B.  Of  course,  this  is  also 
apparent  from  the  strategic  form,  but  the  argument  can  be  made  without  complete 
computation  of  the  strategic  form. 


5.2.2  Battle-of-the-Sexes  with  Two-Sided  Incomplete  Information 


The  next  example  is  a  further  variation  on  the  Battle-of-the-Sexes  game  in  which 
neither  player  knows  whether  the  other  player  wants  to  be  together  with  him/her 
or  not.  It  is  based  on  the  four  bimatrix  games  in  Fig.  5.3.  These  four  bimatrix 
games  correspond  to  the  four  possible  type  combinations  of  players  1  and  2.  The 
probabilities  of  these  four  different  combinations  are  given  in  Table  5.1.  One  way 
to  find  the  Nash  equilibria  of  this  game  is  to  draw  the  extensive  form  and  compute 
the  associated  strategic  form:  see  Problem  5.1.  Alternatively,  we  can  systematically 
examine  the  possible  strategy  pairs,  as  follows. 

First  observe  that  each  player  now  has  four  strategies,  namely  FF,  FB ,  BF ,  and 
BB,  where  the  first  letter  is  the  action  taken  by  the  yes-type  (yi  or  y2),  and  the  second 
letter  is  the  action  taken  by  the  no-type  ( n\  or  ^2)- 


Fig.  5.3  Payoffs  for 
Battle-of-the-Sexes  with  two 
types  per  player 
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Next,  the  conditional  type  probabilities  can  easily  be  computed  from  Table  5.1. 
For  instance, 


piyibx) 


piym) 

p(yO 


p(ym) 

piym)  +p(y\n2) 


2/6 

(2/6)  +  (2/6) 


The  other  conditional  probabilities  are  computed  in  the  same  way,  yielding: 


p{n2\yi)  =  1/2,  p(y2\n\)  =  1/2,  p(n2\m)  =  1/2  , 


p(yi\yi)  =  2/3,  p(n x\y2)  =  1/3,  p(yi\n2)  =  2/3,  p(ni\n2)  =  1/3  . 


We  now  consider  the  four  pure  strategies  of  player  1  one  by  one. 


(i)  Suppose  that  player  1  plays  the  strategy  FF,  meaning  that  he  plays  F  (the  first 
letter)  if  his  type  is  y  \  and  also  F  (the  second  letter)  if  his  type  is  n\.  Then  the 
expected  payoff  for  type  y 2  of  player  2  if  she  plays  F  is 

p(yi\yi)  ■  1  +  p(ni\y2)  •  1  =  (2/3)  •  1  +  (1/3)  1  =  1. 


If  type  y2  of  player  2  plays  B  her  expected  payoff  is 

p(yi\y2)  ■  o  +  p(n\ |v2)  •  o  =  (2/3)  •  o  +  (i/3)  -o  =  o. 

Hence  the  best  reply  of  type  y 2  of  player  2  is  F.  Similarly,  for  type  ri2  of  player 
2,  playing  F  yields 

p(yi  W2)  •  0  +  p(m \n2)  •  0  =  (2/3)  •  0  +  (1/3)  •  0  =  0 


and  playing  B  yields 

p(yi  | n2)  ■  2  +  p(ni  \n2)  ■  2  =  (2/3)  •  2  +  (1/3)  •  2  =  2  , 

so  that  B  is  the  best  reply.  Hence,  player  2’s  best  reply  against  FF  is  FB. 
Suppose,  now,  that  player  2  plays  FB ,  so  type  y2  plays  F  and  type  ri2  plays 
B.  Then  playing  F  yields  type  y i  of  player  1  an  expected  payoff  of 

p(y2\yi)  ■  2  +  p(n2\yi)  ■  0  =  (1/2)  •  2  +  (1/2)  -0=1 

and  playing  B  yields 

p(y2\yi)  ■  o  +  p(n2\yi)  ■  1  =  (1/2)  •  0  +  (1/2)  -1  =  1/2, 
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so  that  F  is  the  best  reply  for  type  y\  of  player  1.  Similarly,  for  type  n\  playing 
F  yields 


p(yM0  -0  +  p(n2\ni)  •  2  =  (1/2)  •  0  +  (1/2)  -2=1 
whereas  playing  B  yields 

p(yi\ni)  •  1  +p(n2\ni)  -0  =  (1/2)  •  1  +  (1/2)  •  0  =  1/2  . 

Hence,  F  is  the  best  reply  for  type  n\ .  Hence,  player  l’s  best  reply  against  FB 
is  FF.  We  conclude  that  (FF,  FB)  is  a  Nash  equilibrium. 

(ii)  Suppose  player  1  plays  FB.  Playing  F  yields  type  y 2  of  player  2  a  payoff  of 

p(yi\y2)  •  1  +p(ni\y2)  -0  =  2/3  •  1  +  1/3  -0  =  2/3 
and  playing  B  yields 

p(yi  \yi)  •  o  +  p(nx  \y2)  •  2  =  2/3  •  0  +  1/3  •  2  =  2/3 
so  that  both  F  and  B  are  best  replies.  Playing  F  yields  type  n2  a  payoff  of 
p(yi\n2)  •  0  +  p(ni\n2)  •  1  =  2/3  *0  +  1/3*  1  =  1/3 
and  playing  B  yields 

p(yi\n2)  •  2  +  p(nx\n2)  *0  =  2/3-2  +  1/3-0  =  4/3 

so  that  B  is  the  best  reply.  Hence,  player  2  has  two  best  replies,  namely  FB  and 
BB.  Against  FB  player  l’s  best  reply  is  FF  [as  established  in  case  (i)]  and  not 
FB ,  so  this  does  not  result  in  a  Nash  equilibrium.  Against  BB  one  can  compute 
in  the  same  way  as  hitherto  that  player  1  ’s  best  reply  is  BF  and  not  FB ,  so  also 
this  combination  is  not  a  Nash  equilibrium. 

(iii)  Suppose  that  player  1  plays  BF.  Then  player  2  has  two  best  replies,  namely 
BF  and  BB.  Against  BF  the  best  reply  of  player  1  is  FF  and  not  BF ,  so  this 
combination  is  not  a  Nash  equilibrium.  Against  BB ,  player  1  ’s  best  reply  is  BF , 
so  the  combination  (BF,  BB)  is  a  Nash  equilibrium. 

(iv)  Finally,  suppose  player  1  plays  BB.  Then  player  2’s  best  reply  is  BF.  Against 
this,  player  l’s  best  reply  is  FF  and  not  BB.  So  BB  of  player  1  is  not  part  of  a 
Nash  equilibrium. 

We  conclude  that  the  game  has  two  Nash  equilibria  in  pure  strategies,  namely:  (i) 
both  types  of  player  1  play  F,  type  y2  of  player  2  also  plays  F  but  type  n2  of  player 
2  plays  B\  (ii)  type  y\  of  player  1  plays  B ,  type  n\  plays  F,  and  both  types  of  player 
2  play  B.  Again,  these  equilibria  are  also  called  Bayesian  Nash  equilibria. 
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5.3  Signaling  Games 

The  extensive  form  can  be  used  to  examine  a  static  game  of  incomplete  information, 
usually  by  letting  the  game  start  with  a  chance  move  that  picks  the  types  of  the 
players  (see  Sect.  5.2).  More  generally,  the  extensive  form  can  be  used  to  describe 
incomplete  information  games  where  players  move  sequentially.  An  important  class 
of  such  games  is  the  class  of  signaling  games. 

A  (finite)  signaling  game  starts  with  a  chance  move  that  picks  the  type  of  player 
1 .  Player  1  is  informed  about  his  type  but  player  2  is  not.  Player  1  moves  first,  player 
2  observes  player  l’s  action  and  moves  next,  and  then  the  game  ends.  Such  a  game 
is  called  a  signaling  game  because  the  action  of  player  1  may  be  a  signal  about  his 
type:  that  is,  from  the  action  of  player  1  player  2  may  be  able  to  infer  something 
about  the  type  of  player  1 . 

5.3.1  An  Example 

Consider  the  example  in  Fig.  5.4.  (The  numbers  between  square  brackets  at  player 
2’s  decision  nodes  are  the  beliefs  of  player  2,  which  are  used  in  a  perfect  Bayesian 
equilibrium  below.)  In  this  game,  player  1  learns  the  result  of  the  chance  move  but 
player  2  does  not.  In  the  terminology  of  Sect.  5.1,  there  are  two  type  combinations, 
namely  (t,  2)  and  (7,2),  each  one  occurring  with  probability  1/2:  these  notations 
express  the  fact  that  player  2  has  only  one  type  (called  ‘2’).  Both  types  of  player  1 
can  choose  between  L  and  R.  Player  2  only  observes  the  action,  L  or  R ,  and  not  the 
type  of  player  1 .  For  this  reason  we  use  the  same  letter  L  for  the  ‘left’  action  of  each 
type:  player  2  cannot  distinguish  between  them.  Similar  for  the  ‘right’  action. 

In  order  to  analyze  this  game  and  find  the  (pure  strategy)  Nash  equilibria,  one 
possibility  is  to  first  compute  the  strategic  form.  Both  players  have  four  strategies. 
Player  1  has  strategy  set 


{LL,  LR,  RL,  RR}  , 
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Fig.  5.5  The  strategic  form 
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where  the  first  letter  refers  to  the  action  of  type  t  and  the  second  letter  to  the  action 
of  type  t.  Player  2  has  strategy  set 

{uu  ,  ud' ,  du  ,  dd'}  . 

The  (expected)  strategic  form  of  the  game  can  be  computed  in  the  usual  way  and 
is  presented  in  Fig.  5.5.  For  instance,  consider  the  strategy  combination  ( LR ,  ud'). 
Then  the  expected  payoffs  are  1/2 •  (2,  6)  +  1/2 •  (2, 4)  =  (2,  5),  etc.  The  (pure)  best 
replies  are  underlined.  This  shows  that  the  game  has  two  Nash  equilibria,  namely 
( RL ,  uu')  and  (LL,  ud') .  What  else  can  be  said  about  these  equilibria?  Observe 
that  the  only  subgame  of  the  game  is  the  entire  game,  so  that  both  equilibria  are 
trivially  subgame  perfect.  Are  they  also  perfect  Bayesian?  That  is,  do  they  satisfy 
the  conditions  of  Bayesian  consistency  and  sequential  rationality  (Sect.  4.4)? 

First  consider  the  equilibrium  (RL,  uu').  Bayesian  consistency  requires 

a  —  Prob  [player  1  has  type  t  |  player  1  plays  L] 

Prob  [player  1  has  type  t  and  plays  L] 

Prob  [player  1  plays  L] 

_  1/2-0 
“  1/2 
=  0 


and,  similarly, 


/3  =  Prob  [player  1  has  type  t  |  player  1  plays  R] 

Prob  [player  1  has  type  t  and  plays  R] 

Prob  [player  1  plays  R] 

_  1/2-1 
“  1/2 
=  1  . 

Given  these  beliefs,  playing  u  yields  a  payoff  of  8  to  player  2  (at  the  left  information 
set),  whereas  playing  d  yields  only  2,  so  u  is  optimal.  Playing  u'  yields  a  payoff 
of  2  to  player  2  (at  the  right  information  set),  whereas  playing  d'  yields  0,  so  u!  is 
optimal.  Hence,  uu!  is  indeed  the  best  reply  of  player  2 — as  we  already  knew  from 
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the  strategic  form.  Thus,  the  pair  ( RL ,  uu')  is  a  perfect  Bayesian  equilibrium  with 
beliefs  a  =  0  and  /3  =  1. 

Note  that,  in  this  case  and,  more  generally,  in  case  of  a  Nash  equilibrium  where 
each  information  set  of  player  2  is  reached  with  positive  probability  (i.e.,  every 
possible  action  of  player  1  is  played  by  some  type  of  player  1),  the  perfect  Bayesian 
equilibrium  requirement  does  not  add  anything  that  is  essential:  the  beliefs  of  player 
2  are  completely  determined  by  Bayesian  consistency,  and  sequential  rationality 
is  automatically  satisfied  for  these  beliefs,  given  that  we  already  have  a  Nash 
equilibrium. 

The  perfect  Bayesian  equilibrium  (RL,  uu')  is  called  separating :  it  separates  the 
two  types  of  player  1,  since  these  types  play  different  actions.  In  this  equilibrium, 
the  action  of  player  1  is  a  signal  for  his  type,  and  the  equilibrium  is  ‘information 
revealing’. 

Next,  consider  the  Nash  equilibrium  (LL,  ud').  In  this  case,  by  Bayesian  consis¬ 
tency  we  obtain 


of  =  Prob  [player  1  has  type  t  |  player  1  plays  L\ 

Prob  [player  1  has  type  t  and  plays  L] 

Prob  [player  1  plays  L] 

_  1/2-1 
”  I 
=  1/2  . 

In  words:  since  each  type  of  player  1  plays  L,  the  conditional  probabilities  of  the  two 
decision  nodes  in  the  left  information  set  of  player  2  are  both  equal  to  1/2.  Given 
a  —  1/2  it  follows  that  u  is  optimal  at  player  2’s  left  information  set  (in  fact,  in 
this  particular  game  u  is  optimal  for  any  a):  we  know  this  already  from  the  strategic 
form,  where  both  uu'  and  ud'  are  best  replies  of  player  2  against  LL. 

How  about  the  belief  (/3, 1  —  /3)?  Since  player  1  always  plays  L,  the  right 
information  set  of  player  2  is  reached  with  zero  probability,  and  therefore  we  cannot 
compute  /3  by  the  formula  for  conditional  probability:  Bayesian  consistency  has  no 
bite,  the  belief  (/3,  1  —  /3)  is  free  (in  the  terminology  of  Sect.  4.4).  Formally, 

P  —  Prob  [player  1  has  type  t  |  player  1  plays  R] 

Prob  [player  1  has  type  t  and  plays  R] 

Prob  [player  1  plays  R] 

but  the  probability  in  the  denominator  of  the  last  expression  is  zero  if  player  1 
plays  LL.  However,  we  still  have  the  sequential  rationality  requirement:  in  order 
for  (LL,  ud')  to  be  a  perfect  Bayesian  equilibrium  the  belief  (/3, 1  —  ft)  should  be 
such  that  player  2’s  action  d'  is  optimal.  Hence,  the  expected  payoff  to  player  2 
from  playing  d'  should  be  at  least  as  large  as  the  expected  payoff  from  playing  u'. 
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so  4(1  —  /3)  >  2/3,  which  is  equivalent  to  /3  <  2/3.  Thus,  (LL,  ud')  is  a  perfect 
Bayesian  equilibrium  with  beliefs  a  =  1/2  and  /3  <  2/3. 

In  this  case,  and  more  generally,  in  cases  where  in  a  Nash  equilibrium  not  every 
information  set  of  player  2  is  reached  with  positive  probability,  i.e.,  there  is  some 
action  of  player  1  which  is  played  by  no  type  of  player  1,  the  perfect  Bayesian 
equilibrium  requirement  does  have  an  impact. 

The  equilibrium  (LL,  ud')  is  called  pooling ,  since  it  ‘pools’  the  two  types  of 
player  1:  both  types  play  the  same  action,  L  in  this  case.  In  this  equilibrium,  the 
action  of  player  1  does  not  reveal  any  information  about  his  type. 


5.3.2  Computing  Perfect  Bayesian  Equilibria  in  the  Extensive  Form 

Perfect  Bayesian  equilibria  can  also  be  found  without  first  computing  the  strategic 
form.  We  consider  again  the  signaling  game  in  Fig.  5.4. 

First,  assume  that  there  is  an  equilibrium  where  player  1  plays  LL.  Then  a  =  1/2 
by  Bayesian  consistency,  and  player  2’s  optimal  action  at  the  left  information  set 
(following  L)  is  u.  At  the  right  information  set,  player  2’s  optimal  action  is  u'  if 
/3  >  2/3  and  d'  if  /3  <  2/3.  If  player  2  would  play  u'  after  R ,  then  type  t  of  player  1 
would  improve  by  playing  R  instead  of  L,  so  this  cannot  be  an  equilibrium.  If  player 
2  plays  d'  after  R ,  then  no  type  of  player  1  would  want  to  play  R  instead  of  L.  We 
have  established  that  (LL,  ud')  with  beliefs  a  —  1/2  and  /3  <  2/3  is  a  (pooling) 
perfect  Bayesian  equilibrium. 

Second,  assume  player  1  plays  LR  in  equilibrium.  Then  player  2’s  beliefs  are 
given  by  a  —  1  and  /3  =  0,  and  player  2’s  best  reply  is  ud'.  But  then  type  t  of  player 
1  would  gain  by  playing  L  instead  of  R,  so  this  cannot  be  an  equilibrium. 

Third,  assume  player  1  plays  RL  in  equilibrium.  Then  a  —  0,  /3  =  1,  and  player 
2’s  best  reply  is  uu' .  Against  uu ' ,  RL  is  player  l’s  best  reply,  so  that  (LL,  uu)  is  a 
(separating)  perfect  Bayesian  equilibrium  with  beliefs  a  —  0  and  /3  =  1. 

Fourth,  suppose  player  1  plays  RR  in  equilibrium.  Then  /3  =  1/2  and  player  2’s 
best  reply  after  R  is  d' .  After  L,  player  2’s  best  reply  is  u  for  any  value  of  a.  Against 
ud',  however,  type  t  of  payer  1  would  gain  by  playing  L  instead  of  R.  So  RR  is  not 
part  of  an  equilibrium. 

Of  course,  these  considerations  can  also  be  based  on  the  strategic  form,  but  we 
do  not  need  the  entire  strategic  form  to  find  the  perfect  Bayesian  equilibria. 


5.3.3  The  Intuitive  Criterion 

In  a  perfect  Bayesian  equilibrium,  if  an  information  set  of  player  2  is  reached  with 
zero  probability,  then  the  belief  of  player  2  on  that  information  set  is  free — the 
only  requirement  is  the  sequential  rationality  requirement  demanding  that,  given 
this  belief,  player  2  should  choose  the  optimal  action.  The  question  is  whether  such 
a  free  belief  is  always  plausible  or  reasonable.  The  so-called  intuitive  criterion  (IC) 
puts  a  restriction  on  the  plausibility  of  free  beliefs. 
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It  works  as  follows.  Consider  a  perfect  Bayesian  equilibrium  in  a  signaling  game 
and  suppose  that  there  is  an  information  set  which  is  reached  in  the  equilibrium 
with  zero  probability.  In  other  words,  there  is  an  action,  say  A,  of  player  1  which 
is  played  by  no  type  of  player  1 .  Consider  a  type  t  of  player  1 .  Suppose  this  type  t 
obtains  payoff  v  in  the  equilibrium  under  consideration.  Then  consider  the  maximal 
possible  payoff  for  this  type  t  attainable  by  playing  A,  say  m.  If  m  <  x,  then  the 
belief  of  player  2  on  the  information  set  following  A  should  assign  zero  probability 
to  type  t  of  player  1.  The  reason  is  indeed  intuitive:  type  t  could  never  possibly  gain 
by  playing  A  instead  of  his  equilibrium  action,  since  at  best  he  would  obtain  m  from 
doing  so,  but  m  is  less  than  what  t  can  get  in  equilibrium,  namely  v.  Therefore,  player 
2  should  not  believe  that  type  t  would  ever  deviate  to  A.  This  comparison  should 
be  made  for  every  type  of  player  1.  This  way,  we  may  obtain  some  restrictions 
on  the  belief  of  player  2  at  the  information  set  following  action  A.  If  the  original 
belief  of  player  2  on  this  information  set,  corresponding  to  the  perfect  Bayesian 
equilibrium  under  consideration,  satisfies  these  restrictions,  then  we  say  that  this 
perfect  Bayesian  equilibrium  survives  the  IC.  However,  it  could  happen  that  this  way 
all  types  of  player  1  get  assigned  zero  probability:  in  that  case,  the  IC  simply  does 
not  apply,  since  the  probabilities  in  a  belief  have  to  sum  up  to  1  [cf.  Problem  5.9(a)]. 

Let  us  apply  the  IC  to  the  perfect  Bayesian  equilibrium  (LL,  ud')  with  /3  <  2/3 
in  the  game  in  Fig.  5.4.  The  equilibrium  payoff  to  type  t  of  player  1  is  equal  to  2. 
If  type  t  of  player  1  deviates  to  R ,  then  he  could  get  maximally  4,  namely  if  player 
2  would  play  u'  following  R.  It  is  important  to  notice  that  we  consider  the  maximal 
possible  payoff  after  such  a  deviation — of  course,  the  payoff  in  the  equilibrium  after 
a  deviation  (in  this  case  0)  can  never  be  higher  by  the  mere  definition  of  Nash 
equilibrium.  Since  4^2,  type  t  could  have  a  reason  to  deviate,  and  so  the  IC 
puts  no  restriction  on  the  belief  of  player  2  that  player  1 ,  if  he  would  deviate  to  R , 
is  of  type  t.  The  equilibrium  payoff  to  type  7  of  player  1  is  4.  Type  7,  however,  could 
get  at  most  2  (in  fact,  would  always  get  2  in  this  game)  by  deviating  to  R.  Since 
2  <  4,  the  IC  now  says  that  it  is  not  reasonable  for  player  2  to  assume  that  type  7 
would  ever  deviate  to  R.  Thus,  1  —  /3  =  0,  so  that  P  —  1 .  With  this  belief,  however, 
(LL,  ud')  can  no  longer  be  sustained  as  a  perfect  Bayesian  equilibrium,  since  for  this 
we  need  ft  <  2/3.  Hence,  the  perfect  Bayesian  equilibrium  with  ft  <2/3  does  not 
survive  the  IC. 


5.3.4  Another  Example 

Consider  the  signaling  game  in  Fig.  5.6.  We  compute  the  pure  strategy  perfect 
Bayesian  equilibria  of  this  game  by  considering  the  strategies  of  player  1  one  by 
one. 

Suppose  player  1  plays  LL  (the  first  letter  refers  to  type  t  and  the  second  to  type 
7).  Then  a  —  1  /2  by  Bayesian  consistency,  so  player  2  is  indifferent  between  u  and 
d.  If  player  2  plays  u ,  however,  then  type  7  obtains  0  and  will  therefore  deviate  to 
R ,  so  that  he  obtains  at  least  1 .  Hence,  player  2  should  play  d  following  L  in  order 
to  get  an  equilibrium.  Then,  again  to  keep  type  7  from  deviating,  player  2  should 
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0,  1 
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1,-1 


3,0 


Fig.  5.6  Another  signaling  game 


play  u!  following  R.  This  is  indeed  optimal  for  player  2  for  P  >  1/2.  Obviously, 
type  t  will  not  deviate  to  R ,  so  (LL,  du' )  with  o'  =  1/2  and  p  >  1  /2  is  a  (pooling) 
perfect  Bayesian  equilibrium.  In  this  equilibrium,  type  t  obtains  3  and  by  deviating 
to  R  obtains  maximally  2.  Type  t  obtains  2  in  equilibrium  and  by  deviating  to  R 
maximally  3.  Hence,  the  IC  prescribes  P  —  0,  but  for  this  belief  (LL,  du')  does 
not  result  in  a  perfect  Bayesian  equilibrium.  Hence,  (LL,  du')  with  o'  =  1/2  and 
P  >  1/2  does  not  survive  the  IC. 

Suppose  player  1  plays  LR.  Then  the  best  reply  of  player  2  is  ud ',  but  then 
type  t  will  deviate  to  R.  So  there  is  no  perfect  Bayesian  equilibrium  (or  even  Nash 
equilibrium)  in  which  player  1  plays  LR. 

Suppose  player  1  plays  RL.  Then  the  best  reply  of  player  2  is  du',  but  then 
type  t  will  deviate  to  L.  So  there  is  no  perfect  Bayesian  equilibrium  (or  even  Nash 
equilibrium)  in  which  player  1  plays  RL. 

Finally,  suppose  player  1  plays  RR.  Then  P  —  1/2  and  player  2  is  indifferent 
between  u'  and  d'  following  R.  If  player  2  plays  u'  then  type  t  of  player  1  will  deviate 
to  R,  where  he  obtains  always  more  than  0.  Hence  in  an  equilibrium  player  2  should 
play  d' .  Again  to  keep  type  t  from  deviating,  player  2  should  play  u  following  L.  This 
is  indeed  optimal  for  player  2  if  o'  >  1/2.  Hence,  (RR,  ud')  with  beliefs  P  —  1/2  and 
a  >  1/2  is  a  (pooling)  perfect  Bayesian  equilibrium.  Type  t  obtains  2  in  equilibrium 
and  maximally  3  by  deviating  to  L.  Type  t  obtains  3  in  equilibrium  and  maximally 
2  by  deviating  to  L.  Thus,  the  IC  prescribes  1—  a  =  0  or  a  =  1,  and  therefore  this 
perfect  Bayesian  equilibrium  survives  the  IC  for  a  =  1. 


5.4  Problems 

5.1.  Battle -of- the -Sexes 

Draw  the  extensive  form  of  the  Battle-of-the-Sexes  game  in  Sect.  5.2  with  payoffs 
in  Fig.  5.3  and  type  probabilities  in  Table  5.1.  Compute  the  strategic  form  and  find 
the  pure  strategy  Nash  equilibria  of  the  game. 
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5.2.  A  Static  Game  of  Incomplete  Information 

Compute  all  pure  strategy  Nash  equilibria  in  the  following  static  game  of  incomplete 
information: 


1.  Chance  determines  whether  the  payoffs  are  as  in  Game  1  or  as  in  Game  2,  each 
game  being  equally  likely. 

2.  Player  1  learns  which  game  has  been  chosen  but  player  2  does  not. 


The  two  bimatrix  games  are: 


L  R 


L  R 


Game  1: 


T  /l,l  0,0\ 
B  \0,0  0,0 J 


Game  2: 


T  ( 0,0  0,0\ 
B  ^0,0  2,2 ) 


5.3.  Another  Static  Game  of  Incomplete  Information 

Player  1  has  two  types,  t\  and  t[,  and  player  2  has  two  types,  ^  and  t'2.  The 
conditional  probabilities  of  these  types  are: 

p(h\h)  =  1,  pitiVi)  =  3/4,  p(h\t2)  =  3/4,  p(ti\t'2)  =  0  . 


(a)  Show  that  these  conditional  probabilities  can  be  derived  from  a  common 
distribution  p  over  the  four  type  combinations,  and  determine  p. 


As  usual  suppose  that  each  player  learns  his  own  type  and  knows  the  conditional 
probabilities  above.  Then  player  1  chooses  between  T  and  B  and  player  2  between  L 
and  R ,  where  these  actions  may  be  contingent  on  the  information  a  player  has.  The 
payoffs  for  the  different  type  combinations  are  given  by  the  bimatrix  games 

L  R  L  R  L  R 

T  ( 2,2  0,0\  ,  Tf  2,2  0,0\  ,  ,  T  ( 2,2  0,0\ 

5(3,0  1,1)  5(0,0  i,ij  ^ :  5(0,0  1,1)’ 

where  the  type  combination  (t\ ,  tf)  is  left  out  since  it  has  zero  probability. 

(b)  Compute  all  pure  strategy  Nash  equilibria  for  this  game. 


5.4.  Job-Market  Signaling 

A  worker  can  have  either  high  or  low  ability,  where  the  probability  of  high  ability 
is  equal  to  2/5.  A  worker  knows  his  ability,  but  a  firm  which  wants  to  hire  the 
worker  does  not.  The  worker,  whether  a  high  or  a  low  ability  type,  can  choose 
between  additional  education  or  not.  Choosing  additional  education  does  not  enlarge 
the  worker’s  productivity  but  may  serve  as  a  signal  to  the  firm:  a  high  ability 
worker  can  choose  education  without  additional  costs,  whereas  for  a  low  ability 
worker  the  cost  of  education  equals  e  >  0.  The  firm  chooses  either  a  high 
or  a  low  wage,  having  observed  whether  the  worker  took  additional  education 
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or  not.  The  payoff  to  the  firm  equals  the  productivity  of  the  worker  minus  the 
wage.  The  payoff  to  the  worker  equals  the  wage  minus  the  cost  of  education; 
if,  however,  this  payoff  is  lower  than  the  worker’s  reservation  utility,  he  chooses 
not  to  work  at  all  and  to  receive  his  reservation  utility,  leaving  the  firm  with  0 
payoff.  Denote  the  productivities  of  the  high  and  low  ability  worker  by  pH  and 
pL ,  respectively,  and  denote  the  high  and  low  wages  by  wh  and  w1.  Finally,  let 
and  r L  denote  the  reservation  utilities  of  both  worker  types.  (All  these  numbers  are 
fixed.) 

(a)  Determine  the  extensive  form  of  this  game. 

(b)  Choose  pH  —  1 0.  pL  =  8,  wh  —  6,  wl  —  3,  =  4,  r1  =  0,  e  =  4.  Compute 

the  strategic  form  of  this  game,  and  determine  the  pure  strategy  Nash  equilibria. 
Also  compute  the  perfect  Bayesian  equilibrium  or  equilibria  in  pure  strategies, 
determine  whether  they  are  separating  or  pooling  and  whether  they  survive 
the  IC. 

5.5.  A  Joint  Venture 

Software  Inc.  and  Hardware  Inc.  are  in  a  joint  venture  together.  The  parts  used 
in  the  joint  product  can  be  defective  or  not;  the  probability  of  defective  parts  is 
0.7,  and  this  is  commonly  known  before  the  start  of  the  game.  Each  can  exert 
either  high  or  low  effort,  which  is  equivalent  to  costs  of  20  and  0.  Hardware 
moves  first,  but  software  cannot  observe  his  effort.  Revenues  are  split  equally 
at  the  end.  If  both  firms  exert  low  effort,  total  profits  are  100.  If  the  parts  are 
defective,  the  total  profit  is  100;  otherwise  (i.e.,  if  the  parts  are  not  defective), 
if  both  exert  high  effort,  profit  is  200,  but  if  only  one  player  does,  profit  is 
100  with  probability  0.9  and  200  with  probability  0.1.  Hardware  discovers  the 
truth  about  the  parts  by  observation  before  he  chooses  effort,  but  software  does 
not. 

(a)  Determine  the  extensive  form  of  this  game.  Is  this  a  signaling  game? 

(b)  Determine  the  strategic  form  of  this  game. 

(c)  Compute  the  (pure)  Nash  equilibria?  Which  one(s)  is  (are)  subgame  perfect? 
Perfect  Bayesian? 

5.6.  Entry  Deterrence 

The  entry  deterrence  game  of  Chap.  1  is  reproduced  in  Fig.  5.7.  For  this  game, 
compute  the  pure  strategy  perfect  Bayesian  equilibria  for  every  value  of  x  e  R. 
Which  one(s)  is  (are)  pooling  or  separating?  Satisfy  the  intuitive  criterion? 


5.7.  The  Beer- Quiche  Game 

Consider  the  following  two-player  signaling  game.  Player  1  is  either  ‘weak’  or 
‘strong’.  This  is  determined  by  a  chance  move,  resulting  in  player  1  being  ‘weak’ 
with  probability  1/10.  Player  1  is  informed  about  the  outcome  of  this  chance 
move  but  player  2  is  not;  but  the  probabilities  of  either  type  of  player  1  are 
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Fig.  5.7  The  entry 
deterrence  game  of 
Problem  5.6 


common  knowledge  among  the  two  players.  Player  1  has  two  actions:  either  have 
quiche  ( Q )  or  have  beer  ( B )  for  breakfast.  Player  2  observes  the  breakfast  of 
player  1  and  then  decides  to  duel  ( D )  or  not  to  duel  (N)  with  player  1.  The 
payoffs  are  as  follows.  If  player  1  is  weak  and  eats  quiche  then  D  and  N  give 
him  payoffs  of  1  and  3,  respectively;  if  he  is  weak  and  drinks  beer,  then  these 
payoffs  are  0  and  2,  respectively.  If  player  1  is  strong,  then  the  payoffs  are  0 
and  2  from  D  and  N ,  respectively,  if  he  eats  quiche;  and  1  and  3  from  D  and  N , 
respectively,  if  he  drinks  beer.  Player  2  has  payoff  0  from  not  duelling,  payoff  1 
from  duelling  with  the  weak  player  1 ,  and  payoff  —  1  from  duelling  with  the  strong 
player  1 . 

(a)  Draw  a  diagram  modelling  this  situation. 

(b)  Compute  all  the  pure  strategy  Nash  equilibria  of  the  game.  Find  out  which  of 
these  Nash  equilibria  are  perfect  Bayesian  equilibria.  Give  the  corresponding 
beliefs  and  determine  whether  these  equilibria  are  pooling  or  separating,  and 
which  ones  satisfy  the  intuitive  criterion. 

5.8.  Issuing  Stock 

In  this  story  the  players  are  a  manager  (M)  and  an  existing  shareholder  ( O ).  The 
manager  is  informed  about  the  current  value  of  the  firm,  a ,  and  the  NPV  (net  present 
value)  of  a  potential  investment  opportunity,  b ,  but  the  shareholder  only  knows 
that  high  values  and  low  values  each  have  probability  1/2.  More  precisely,  either 
(< a,b )  =  (< a,b )  or  (< a,b )  =  (a,  b),  each  with  probability  1/2,  where  a  <  a  and 
b  <  b.  The  manager  moves  first  and  either  proposes  to  issue  new  stock  E  (where  E 
is  fixed)  to  undertake  the  investment  opportunity,  or  decides  not  to  issue  new  stock. 
The  existing  shareholder  decides  whether  to  approve  of  the  new  stock  issue  or  not. 
The  manager  always  acts  in  the  interest  of  the  existing  shareholder:  their  payoffs  in 
the  game  are  always  equal. 
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If  the  manager  decides  not  to  issue  new  stock,  then  the  investment  opportunity 
is  foregone,  and  the  payoff  is  either  a  or  a.  If  the  manager  proposes  to  issue  new 
stock  but  this  is  not  approved  by  the  existing  shareholder,  then  again  the  investment 
opportunity  is  foregone  and  the  payoff  is  either  a  or  a.  If  the  manager  proposes  to 
issue  new  stock  E  and  the  existing  shareholder  approves  of  this,  then  the  payoff  to 
the  existing  shareholder  is  equal  to  [M/ (. M  +  E)\(a  -\-b  -\-  E)  in  the  good  state  (a,  b) 
and  [M/ (. M  +  E)\(a  +  b  +  E)  in  the  bad  state  (a,  /?);  here,  M  —  (1/2 )[a  +  b\  + 
(1/2)  [a  +  b\  is  the  price  of  the  existing  shares  if  the  investment  is  undertaken. 

(a)  Set  up  the  extensive  form  of  this  signaling  game. 

(b)  Take  a  —  1 50,  a  =  50,  b  —  20,  b  —  10,  and  E  —  100.  Compute  the  pure 
strategy  perfect  Bayesian  equilibria  of  this  game.  Are  they  pooling,  separating? 
How  about  the  intuitive  criterion?  Try  to  interpret  the  results  from  an  economic 
point  of  view. 

(c)  Repeat  the  analysis  of  (b)  for  b  —  100. 


5.9.  More  Signaling  Games 

(a)  Consider  the  signaling  game  in  Fig.  5.4,  but  with  payoffs  (1,2)  instead  of  (4,  2) 
if  type  t  of  player  1  plays  R  and  player  2  plays  u' .  Show  that  this  game  has  only 
pooling  equilibria.  Which  ones  survive  the  IC? 

(b)  Compute  the  pure  strategy  perfect  Bayesian  equilibria  and  test  for  the  intuitive 
criterion  in  the  signaling  game  in  Fig.  5.8. 

(c)  Consider  the  signaling  game  in  Fig.  5.9,  where  the  chance  move  is  not  explicitly 
drawn  in  order  to  keep  the  diagram  simple.  Compute  the  pure  strategy  perfect 
Bayesian  equilibria  and  test  for  the  intuitive  criterion. 


0,1 


3,0 

1,0 

2,2 


Fig.  5.8  The  signaling  game  of  Problem  5.9(b) 
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Fig.  5.9  The  signaling  game  of  Problem  5.9(c).  Each  type  of  player  1  has  probability  1/3 


5.5  Notes 

The  type  terminology  and  corresponding  theory  is  due  to  Harsanyi  (1967/1968). 
The  Battle  of  the  Sexes  examples  in  Sect.  5.2  are  taken  from  Osborne  (2004).  One 
of  the  first  examples  of  a  signaling  game  is  the  Spence  (1973)  job  market  signaling 
model  (see  Problem  5.4).  The  intuitive  criterion  is  due  to  Cho  and  Kreps  (1987). 

Problem  5.5  is  taken  from  Rasmusen  (1989).  The  beer-quiche  game  of  Prob¬ 
lem  5.7  is  from  Cho  and  Kreps  (1987).  Problem  5.8  is  based  on  Myers  and  Majluf 
(1984). 
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In  Chaps.  2-5  we  have  studied  noncooperative  games  in  which  the  players  have 
finitely  many  (pure)  strategies.  The  reason  for  the  finiteness  restriction  is  that  in  such 
games  special  results  hold,  such  as  the  existence  of  a  value  and  optimal  strategies 
for  two-person  zero-sum  games,  and  the  existence  of  a  Nash  equilibrium  in  mixed 
strategies  for  finite  nonzero- sum  games. 

The  basic  game-theoretical  concepts  discussed  in  these  chapters  can  be  applied  to 
more  general  games.  Once,  in  a  game-theoretic  situation,  the  players,  their  possible 
strategies,  and  the  associated  payoffs  are  identified,  the  concepts  of  best  reply  and  of 
Nash  equilibrium  can  be  applied.  Also  the  concepts  of  backward  induction,  subgame 
perfection,  and  perfect  Bayesian  equilibrium  carry  over  to  quite  general  extensive 
form  games.  In  games  of  incomplete  information,  the  concept  of  player  types  and 
the  associated  Nash  equilibrium  (Bayesian  Nash  equilibrium)  can  be  applied  also  if 
the  game  has  infinitely  many  strategies. 

The  bulk  of  this  chapter  consists  of  diverse  examples  verifying  these  claims. 
The  main  objective  of  the  chapter  is,  indeed,  to  show  how  the  basic  game-theoretic 
apparatus  can  be  applied  to  various  different  conflict  situations;  and,  of  course,  to 
show  these  applications  themselves. 

In  Sect.  6.1  we  generalize  some  of  the  concepts  of  Chaps.  2  and  3.  This  section 
serves  only  as  background  and  general  framework  for  the  examples  in  the  following 
sections — most  of  the  remainder  of  this  chapter  can  also  be  understood  without 
this  general  framework  and  the  reader  may  choose  to  postpone  reading  it.  Concepts 
specific  to  extensive  form  games  and  to  incomplete  information  games  are  adapted 
later,  when  they  are  applied.  In  Sects.  6. 2-6. 7  we  discuss  Cournot  competition 
with  complete  and  incomplete  information,  Bertrand  competition,  Stackelberg 
equilibrium,  auctions  with  complete  and  incomplete  information,  mixed  strategies 
with  objective  probabilities,  and  sequential  bargaining.  Variations  on  these  topics 
and  various  other  topics  are  treated  in  the  problem  section. 
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6  Noncooperative  Games:  Extensions 


6.1  General  Framework:  Strategic  Games 

An  n-person  strategic  game  is  a  In  +  1 -tuple 

G  —  (A ,  S i , . . . ,  U\, ,  Uyi)  , 


where 


•  N  —  {l, ...  ,n},  with  n  >  1 ,  is  the  set  of  players ; 

•  for  every  i  £  A,  Si  is  the  strategy  set  of  player  i\ 

•  for  every  i  £  A,  ut  :  S  =  S\  x  ...  x  Sn  ->  R  is  the  payoff  function  of  player  i\ 
i.e.,  for  every  strategy  combination  (s\, . . . ,  sn)  £  S  where  s\  £  S\,  . . .,  sn  £  Sn, 
ut{s\ , ...  ,sn)  £  R  is  player  f  s  payoff. 


A  best  reply  of  player  i  to  the  strategy  combination  (s\, ...,  Si-\,  Si+\, ... ,  sn)  of  the 
other  players  is  a  strategy  S[  £  Si  such  that 


Si-i,Si,Si+i,.  ..,Sn)>  Ui(s  1, 


for  all  s'-  £  5/. 

i  i 

A  Nash  equilibrium  of  G  is  a  strategy  combination  (s*, . . . ,  s*)  e  S  such  that  for 
each  player  i,  sf  is  a  best  reply  to  (s* , . . . ,  s*_{ ,  s*+1 , . . . ,  s*). 

A  strategy  s-  £  5/  of  player  i  is  strictly  dominated  by  s;  £  Si  if 


5  *  *  *  5  $i—  1  5  ^/+1  5  *  *  *  ?  $n)  ^  ^z(N  )  •  •  •  5  1  5  5  •  •  •  ? 


for  all  (s\ , . . . ,  Si- 1 ,  , . . .  ,sn)  £  5i  x  . . .  x  S/_i  x  5/+i  x  . . .  x  Sw,  i.e.,  for  all 

strategy  combinations  of  players  other  than  i.  Clearly,  a  strictly  dominated  strategy 
is  never  used  in  a  Nash  equilibrium. 

Finally  we  define  weak  domination.  A  strategy  s\  £  Si  of  player  i  is  weakly 
dominated  by  st  £  St  if 


>  •  •  •  ?  $i—  1  j  j  •  •  •  >  ‘SVz)  ^z(N  j  •  •  •  >  $i—  1 5  5  •  •  •  ? 


for  all  (si , . . . ,  Si- 1 ,  , . . . ,  s„)  £  Si  x  . . .  x  Si- 1  x  5/+i  x  ...  x  Sn,  such  that  at 

least  once  this  inequality  is  strict. 

The  reader  may  verify  that  matrix  games  (Chap.  2)  and  bimatrix  games  (Chap.  3) 
are  special  cases  of  this  general  framework,  in  which  the  set  Si  is  the  set  of  all  mixed 
strategies  of  player  i.  The  same  is  true  for  the  concepts  of  Nash  equilibrium  and 
domination  discussed  in  these  chapters. 
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6.2  Cournot  Quantity  Competition 

6.2.1  Simple  Version  with  Complete  Information 

In  the  simplest  version  of  the  famous  Cournot  model,  two  firms  producing  a 
homogenous  good  compete  in  quantity.  Each  firm  offers  a  quantity  of  this  good  on 
the  market.  The  price  of  the  good  depends  on  the  total  quantity  offered:  the  higher 
this  quantity  is,  the  lower  the  price  of  the  good.  The  profit  for  each  firm  is  equal  to 
total  revenue  (price  times  quantity)  minus  total  cost.  This  gives  rise  to  a  two-person 
game  in  which  the  players  are  the  firms,  the  players’  strategies  are  the  quantities 
offered  and  the  payoff  functions  are  the  profit  functions.  In  a  simple  version,  the 
price  depends  linearly  on  total  quantity  and  marginal  cost  is  constant  and  positive 
while  there  are  no  fixed  costs.  Specifically,  we  study  the  following  game. 

(a)  The  set  of  players  is  TV  =  {1,2}. 

(b)  Each  player  i  —  1,2  has  set  of  strategies  5)  =  [0,  oo),  with  typical  element  q /. 

(c)  The  payoff  function  of  player  i  is  Hi(q\,q2)  —  qiP(q\,qi)  ~  cqi ,  for  all 
qi,  q2  >  0.  Here, 


Piquqi) 


a  —  q\  —  q2  if  <7 1  +  q2  <  a 
0  if  q\  +  q2  >  a 


is  the  market  price  of  the  good,  where  a  is  a  constant,  and  c  is  marginal  cost, 
with  a  >  c  >  0. 


A  Nash  equilibrium  in  this  game  is  a  pair  (q^ ,  q% ),  with  q^,q^  >  0,  of  mutually 
best  replies,  that  is, 

HiOjf  ,q2)  >  Hi(quq^),  Ujiq^ ,  q2)  >  n2(<?f,^2)  forallgi,g2  >  0  . 

This  equilibrium  is  also  called  Cournot  equilibrium.  To  find  the  equilibrium,  we 
first  compute  the  best  reply  functions,  also  called  reaction  functions.  The  reaction 
function  /3 1  (^2)  of  player  1  is  found  by  solving  the  maximization  problem 

max  Ui(quq2) 
q  1>0 

for  each  given  value  of  ^2  >  0.  If  <72  >  a  then  P{q\ ,  ^2)  =  0  for  every  q\  and  the 
profit  of  firm  1  is  equal  to  —  cq\  so  that,  clearly,  the  maximum  is  attained  by  setting 
q\  —  0.  For  q2  <  a  we  have 


n  1(41,42) 


q\(a  -  q\  -  qi)  -  cqi  if  41  <  a  -  q2 
—cq\  if  q\  >  a  —  q2 
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so  that  the  maximum  is  attained  for  some  q\  <  a  —  q2.  In  that  case,  function 

q\  ( a-qi  -  q2)  -  cqx  —  q\{a  -  c  -  qx  -  q2) 


has  to  be  maximized  with  respect  to  q\  >  0.  If  <72  >  a  —  c  then  the  maximum  is 
attained  for  q\  —  0.  If  <72  5  a  —  c  then  we  compute  the  maximum  by  setting  the 
derivative  with  respect  to  q\ ,  namely  the  function  a  —  2q\  —  q2~c,  equal  to  zero.  This 
yields  q\  —  (a  —  c  —  q2)/2.  (The  second  derivative  is  equal  to  —2  so  that,  indeed, 
we  have  a  maximum.)  Summarizing,  we  have 


fi\(qi) 


{  a  c2  92  }  if  q2  <  a  -  c 
{0}  if  q2  >  a  —  c  . 


Since  this  reaction  function  is  single- valued  for  all  <72,  we  can  omit  the  braces  at  the 
right-hand  side  and  write 


Pl  (<?2> 


a—c—q 2  • r  ^ 

- 2^  —  a  ~  C 

0  if  q2  >  a  —  c  . 


(6.1) 


By  symmetric  arguments  we  obtain  for  the  reaction  function  of  player  2: 


^2(^1) 


a—c—q  1  • r  ^ 

— 11  q\  <  a  —  c 
0  if  q\  >  a  —  c  . 


(6.2) 


These  reaction  functions  are  drawn  in  Fig.  6.1.  The  Nash  equilibrium  is  the  point  of 
intersection  of  the  reaction  functions.  It  is  obtained  by  simultaneously  solving  the 
two  equations  q\  —  {a  —  c  —  <72)/ 2  and  q2  —  {a  —  c  —  q\)/ 2,  resulting  in 


6.2. 1.1  Pareto  Optimality 

A  pair  (q\,q2)  of  strategies  is  Pareto  optimal  if  there  is  no  other  pair  (q[,q f2) 
such  that  the  associated  payoffs  are  at  least  as  good  for  both  players  and  strictly 
better  for  at  least  one  player.  Not  surprisingly,  the  equilibrium  (q^ ,  q £ )  is  not 
Pareto  optimal.  For  instance,  both  players  can  strictly  benefit  from  joint  profit 
maximization,  attained  by  solving  the  problem 

max  Ui(quq2)  +  n2(^i,^2)  • 

q\,q2>0 

This  amounts  to  solving  the  maximization  problem 

max  (< qi  +  q2){a  -  c  -  qx  -  q2) 

q\m>Q 
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Fig.  6.1  The  Cournot  model:  the  thick  black  piecewise  linear  curve  is  the  reaction  function  of 
player  1  and  the  thick  gray  piecewise  linear  curve  is  the  reaction  function  of  player  2.  The  point 
qc  is  the  Nash-Coumot  equilibrium.  The  two  isoprofit  curves  of  the  players  through  the  Nash 
equilibrium  are  drawn.  The  curve  intersecting  the  gi-axis  is  the  isoprofit  curve  of  player  1:  profit 
increases  if  this  curve  shifts  downwards.  The  curve  intersecting  the  g2-ax is  is  the  isoprofit  curve 
of  player  2:  profit  increases  if  this  curve  shifts  leftwards.  The  shaded  area  consists  of  the  quantity 
combinations  that  Pareto  dominate  the  equilibrium 


or,  writing  Q  =  qx  +  q2. 


max  Q(a  —  c  —  Q) 

Q>0 

which  yields  Q  —  (a  —  c)/ 2.  Observe  that  this  is  just  the  monopoly  quantity.  Thus, 
joint  profit  maximization  is  attained  by  any  pair  (q\,qi)  >  0  with  q\  +  q2  —  (a  — 
c)/ 2.  Taking,  in  particular,  q\  —  q2  =  (a  —  c)/ 4  yields  each  player  a  profit  of 
(a  —  c)2 / 8,  whereas  in  the  Nash  equilibrium  each  player  obtains  (a  —  c)2 / 9.  See 
also  Fig.  6.1,  where  all  points  in  the  gray-shaded  area  ‘Pareto  dominate’  the  Nash 
equilibrium:  the  associated  payoffs  are  at  least  as  good  for  both  agents  and  better 
for  at  least  one  agent. 


6.2.2  Simple  Version  with  Incomplete  Information 

Consider  the  Cournot  model  of  Sect.  6.2.1  but  now  assume  that  the  marginal  cost  of 
firm  2  is  either  high,  c#,  or  low,  cl ,  where  ch  >  ci  >  0.  Firm  2  knows  its  marginal 
cost  but  firm  1  only  knows  that  it  is  Ch  with  probability  ft  and  ci  with  probability 
1  —  ft.  The  cost  of  firm  1  is  c  >  0  and  this  is  commonly  known.  In  the  terminology 
of  Sect.  5.1,  player  1  has  only  one  type  but  player  2  has  two  types,  cH  and  c l.  The 
associated  game  is  as  follows. 
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(a)  The  player  set  is  { 1 , 2}. 

(b)  The  strategy  set  of  player  1  is  [0,  oo)  with  typical  element  qx ,  and  the  strategy 
set  of  player  2  is  [0,  oo)  x  [0,  oo)  with  typical  element  (qH,  qL).  Here,  qu  is  the 
chosen  quantity  if  player  2  has  type  Ch ,  and  q^  is  the  chosen  quantity  if  player 
2  has  type  cL. 

(c)  The  payoff  functions  of  the  players  are  the  expected  payoff  functions.  These  are 

Ui(qi,qH,qL)  =  ^Ui(qi,qH)  +  (1  -  ft)Ili(qi,  qL)  , 

for  i  =  1,2,  where  nz-(-,  •)  is  the  payoff  function  from  the  Cournot  model  of 
Sect.  6.2.1. 

To  find  the  (Bayesian)  Nash  equilibrium,  we  first  compute  the  best  reply  function  or 
reaction  function  of  player  1,  by  maximizing  H\(qi,qH,  q l)  over  q\  >  0,  with  qH 
and  qL  regarded  as  given.  Hence,  we  solve  the  problem 

max  ft[qi(a  —  c  —  q\—  qH)\  +  (1  -  ft)  [q\  (a  -  c  -  qx  -  qL)\  . 

^i>0 

Assuming  qn,qL  S  a  —  c  (this  has  to  be  checked  later  for  the  equilibrium),  this 
problem  is  solved  by  setting  the  derivative  with  respect  to  q\  equal  to  zero,  which 
yields 


q\  —  (<1h,  <7l)  — 


a-  c  -  ftqH  ~  (1  -  $)qL 
2 


(6.3) 


Observe  that,  compared  to  (6.1),  we  now  have  the  expected  quantity  ftqH  +  (1  —  #)#l 
instead  of  q 2:  this  is  due  to  the  linearity  of  the  model. 

For  player  2,  we  consider,  for  given  q\,  the  problem 


max  &  [qH(q  —  cH  —  q\—  qii)\  +  (1  -  &)  [qda  —  cL  —  q\—  q£)\  • 

qH,qL>  0 

Since  the  first  term  in  this  function  depends  only  on  qH  and  the  second  term  only 
on  qL ,  solving  this  problem  amounts  to  maximizing  the  two  terms  separately.  In 
other  words,  we  determine  the  best  replies  of  types  ch  and  cl  separately.  Assuming 
q\  <  a  —  ch  (and  hence  q\  <  a  —  cl )  this  results  in 

a-cH-qx 

qH  =  qH(q  1)  =  - - -  (6.4) 


^his  is  generally  so  in  a  Bayesian,  incomplete  information  game:  maximizing  the  expected  payoff 
of  a  player  over  all  his  types  is  equivalent  to  maximizing  the  payoff  per  type. 
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and 


a  —  Cr  —  q  i 

<7l  =  qdq  0  = - 2 - •  (6-5) 

The  Nash  equilibrium  is  obtained  by  simultaneously  solving  (6.3)-(6.5)  (using 
substitution  or  Gaussian  elimination).  The  solution  is  the  triple 


<1\  = 


a  —  2  c  T  $C]l  T  (1  — 


Vh 


a  —  2  ch  +  c  1  —  ft  , 

- r - +  —^{cH  -  cl) 


<& 


a  —  2cl  +  c  & 

- j - ~(c„  -  cL) 


Assuming  that  the  parameters  of  the  game  are  such  that  these  three  values  are 
nonnegative  and  that  q\  <  a  —  ch  and  qH,qL  <  a  —  c,  this  is  the  Bayesian 
Nash-Cournot  equilibrium  of  the  game.  This  solution  may  be  compared  with  the 
Nash  equilibrium  in  the  complete  information  model  with  asymmetric  costs,  see 
Problem  6.1.  The  high  cost  type  of  firm  2  produces  more  than  it  would  in  the 
complete  information  case:  it  benefits  from  the  fact  that  firm  1  is  unsure  about  the 
cost  of  firm  2  and  therefore  produces  less  than  it  would  if  it  knew  for  sure  that  firm 
2  had  high  costs.  Similarly,  the  low  cost  firm  2  produces  less. 


6.3  Bertrand  Price  Competition 

Consider  two  firms  who  compete  in  the  price  of  a  homogenous  good.  Specifically, 
assume  that  the  demand  q  for  the  good  is  given  by  q  —  q(p)  —  ma x{a  —  p ,  0} 
for  every  price  p  >  0,  where  a  is  a  positive  constant  (the  demand  for  the  good  if 
the  price  is  zero).  The  firm  with  the  lower  price  serves  the  whole  market;  if  prices 
are  equal  the  firms  share  the  market  equally.  Each  firm  has  the  same  marginal  cost 
0  <  c  <  a,  and  no  fixed  cost.  If  firm  1  sets  a  price  p\  and  firm  2  sets  a  price  p2 ,  then 
the  profit  of  firm  1  is 


V\\(p\,P2)  = 


(p\-c)(a-p\)  if  p\  <  p2  and p\  <  a 
\{pi  -  c){a  - px)  if  P\  —  P2  and p\  <a 
0  in  all  other  cases. 


Similarly,  the  profit  of  firm  2  is 


(p2  -c)(a-  p2)  if  Pi  <  Pi  and  p2  <  a 
n2(puP2)  =  {  \(P2  -  (')<a  —  p2)  if  Pi  =  p2  and  p2  <  a 

0  in  all  other  cases. 
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Fig.  6.2  The  profit  function 
of  firm  i  in  the  monopoly 
situation 


IT 


Thus,  the  two  firms  are  the  players  in  this  game,  and  their  profit  functions  are  the 
payoff  functions;  the  strategy  sets  are  [0,  oo)  for  each,  with  typical  elements  p\  and 
P2.  To  find  a  Nash  equilibrium  ( Bertrand  equilibrium )  we  first  compute  the  best 
reply  functions  (reaction  functions).  An  important  role  is  played  by  the  price  that 
maximizes  profit  if  there  is  only  one  firm  in  the  market,  i.e.,  the  monopoly  price 
pm  —  (a  +  c)/  2,  obtained  by  solving  the  problem 

max  (p  —  c)(a  —  p)  . 

p>  o 


Note  that  the  monopoly  profit  function  (or  the  profit  function  of  each  firm  in  the 
monopoly  situation)  is  a  quadratic  function,  and  that  profit  increases  as  the  price 
gets  closer  to  the  monopoly  price.  See  Fig.  6.2. 

To  determine  player  l’s  best  reply  function  /3 1 ( p2 )  we  distinguish  several  cases. 

If  p2  <  c ,  then  any  p\  <  P2  yields  player  1  a  negative  payoff,  whereas  any 
Pi  >  Pi  yields  a  payoff  of  zero.  Hence,  the  set  of  best  replies  in  this  case  is  the 
interval  (p2 ,  oo). 

If  p2  —  c,  then  any  p\  <  p2  yields  a  negative  payoff  for  player  1,  and  any  p\  >  p2 
yields  zero  payoff.  So  the  set  of  best  replies  in  this  case  is  the  interval  [c,  oo). 

If  c  <  p2  <  pm,  then  the  best  reply  of  player  1  would  be  a  price  below  p2 
(to  obtain  the  whole  market)  and  as  close  to  the  monopoly  price  as  possible  (to 
maximize  payoff)  but  such  a  price  does  not  exist:  for  any  price  p\  <  p2,  a  price  in 
between  p\  and  p2  would  still  be  better.  Hence,  in  this  case  the  set  of  best  replies  of 
player  1  is  empty. 

If  Pi  >  pm  then  the  unique  best  reply  of  player  1  is  the  monopoly  price  pm. 


2  If  prices  are  in  smallest  monetary  units  this  somewhat  artificial  consequence  is  avoided.  See 
Problem  6.7. 
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Summarizing  we  obtain 

Pi(pi)  = < 

For  player  2,  similarly, 

Pi(p\)  = ' 


{p  1 1 

\Pl 

> 

Pi} 

iff  2 

<  c 

{pi  1 

\Pl 

> 

iff  2 

=  c 

0 

if  C  <  P2  <  pm 

{ pm } 

iff  2 

>  pm. 

{ P2  | 

\P2 

> 

Pi} 

iff  l 

<  c 

iP2  | 

\P2 

> 

if  f  l 

—  c 

0 

if  c  <  pi  <  pm 

{ Pm\ 

if  Pi 

>  pm. 

The  point(s)  of  intersection  of  these  best  reply  functions  can  be  found  by  making  a 
diagram  or  by  direct  inspection.  We  follow  the  latter  method  and  leave  the  diagram 
method  to  the  reader.  If  p2  <  c  then  by  /3\  ( p2 )  a  best  reply  p\  satisfies  p\  >  p2-  But 
then,  according  to  fcipi),  we  must  have  p2  —  pm ,  a  contradiction  since  pm  >  c. 
Therefore,  in  equilibrium,  we  must  have  p2  >  c.  If  p2  —  c,  then  p\  >  c;  if  however, 
pi  >  c  then  the  only  possibility  is  p2  =  pm ,  a  contradiction.  Hence,  p\  —  c  as  well 
and,  indeed,  p\  —  p2  —  c  is  a  Nash  equilibrium.  If  p2  >  c ,  then  the  only  possibility 
is  pi  =  pm  but  then  p2  is  never  a  best  reply.  We  conclude  that  the  unique  Nash 
equilibrium  (Bertrand  equilibrium)  is  p\  —  p2  —  c. 

It  is  also  possible  to  establish  this  result  without  completely  computing  the  best 
reply  functions.  Suppose,  in  equilibrium,  that  p\  ^  p2 ,  say  p\  <  p2.  If  p\  <  pm 
then  player  1  can  increase  his  payoff  by  setting  a  higher  price  still  below  p2.  If 
Pi  >  pm  then  player  2  can  increase  his  payoff  by  setting  a  price  below  pi,  e.g., 
slightly  below  pm  if  pi  =  pm  and  equal  to  pm  if  pi  >  pm.  Hence,  we  must  have  pi  = 
P2  in  equilibrium.  If  this  common  price  is  below  c  then  each  player  can  improve 
by  setting  a  higher  price.  If  this  common  price  is  above  c  then  each  player  can 
improve  by  setting  a  slightly  lower  price.  Hence,  the  only  possibility  that  remains  is 
Pi  —  Pi  —  c,  and  this  is  indeed  an  equilibrium,  as  can  be  verified  directly. 

A  few  remarks  on  this  equilibrium  are  in  order.  First,  it  is  again  Pareto  inferior. 
For  example,  both  firms  setting  the  monopoly  price  results  in  higher  profits.  Second, 
each  firm  plays  a  weakly  dominated  strategy:  any  price  c  <  pt  <  a  weakly 
dominates  pt  =  c,  since  it  always  results  in  a  positive  or  zero  profit  whereas  pi  —  c 
always  results  in  zero  profit.  Third,  the  Bertrand  equilibrium  is  beneficial  from  the 
point  of  view  of  the  consumers:  it  maximizes  consumer  surplus. 

See  Problem  6.3(d)-(f)  for  an  example  of  price  competition  with  heterogenous 
goods. 
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6.4  Stackelberg  Equilibrium 

In  the  Cournot  model  of  Sect.  6.2.1,  the  two  firms  move  simultaneously.  Consider 
now  the  situation  where  firm  1  moves  first,  and  firm  2  observes  this  move  and 
moves  next.  This  situation  has  already  been  discussed  in  Chap.  1.  The  corresponding 
extensive  form  game  is  given  in  Fig.  6.3.  In  this  game,  player  1  has  infinite 
action/strategy  set  [0,  oo),  with  typical  element  q\.  In  the  diagram,  we  use  a  zigzag 
line  to  express  the  fact  that  the  number  of  actions  is  infinite.  Player  2  has  the  infinite 
set  of  actions  [0,  oo)  with  typical  element  <72,  again  represented  by  a  zigzag  line. 
A  strategy  of  player  2  assigns  to  each  information  set,  hence  to  each  decision 
node — the  game  has  perfect  information — an  action.  Since  each  decision  node  of 
player  2  follows  an  action  q\  of  player  1,  a  strategy  of  player  2  is  a  function 
^2  •  [0,  00)  ->  [0,  00).  Hence,  <72  =  siiqi)  is  the  quantity  that  firm  2  offers  if 
firm  1  has  offered  q\ .  Obviously,  the  number  of  strategies  of  player  2  is  infinite  as 
well.  The  appropriate  solution  concept  is  backward  induction  or  subgame  perfect 
equilibrium.  The  subgames  of  this  game  are  the  entire  game  and  the  infinite  number 
of  one-player  games  starting  at  each  decision  node  of  player  2,  i.e.,  following  each 
choice  q\  of  player  1.  Hence,  the  subgame  perfect  equilibrium  can  be  found  by 
backward  induction,  as  follows.  In  each  subgame  for  player  2,  that  is,  after  each 
choice  q\,  player  2  should  play  optimally.  This  means  that  player  2  should  play 
according  to  the  reaction  function  ^2(^1)  as  derived  in  (6.2).  Then,  going  back 
to  the  beginning  of  the  game,  player  1  should  choose  q\  >  0  so  as  to  maximize 
Tl  1  (<71 ,  ^2(^1)).  In  other  words,  player  1  takes  player  2’s  optimal  reaction  into 
account  when  choosing  q\ .  Assuming  q\  <  a  —  c  (it  is  easy  to  verify  that  q\  >  a  — c 
is  not  optimal)  player  1  maximizes  the  expression 


a  —  c  —  q  1 


2 


The  maximum  is  obtained  for  q\  =  (a  —  c)/ 2,  and  thus  q 2  —  P2  {ip  —  c)/ 2)  = 
(1 a  —  c)/4.  Hence,  the  subgame  perfect  equilibrium  of  the  game  is: 


q\  =  (a-  c)/2,  q2  =  Pi(q\)  . 


The  subgame  perfect  equilibrium  outcome  is  by  definition  the  resulting  play  of  the 
game,  that  is,  the  actions  chosen  on  the  equilibrium  path  in  the  extensive  form.  In 


1  q\  >  0  2  <?2  >  0 

rvvvvvvvvvrvvvvwvvvf  (Ul(ql,q2),U2(ql,q2)) 


Fig.  6.3  Extensive  form  representation  of  the  Stackelberg  game  with  firm  1  as  the  leader 


3  In  mathematical  notation  the  strategy  set  of  player  2  is  the  set  [0,  oo)[°,00f 
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Fig.  6.4  As  before,  the  thick 
black  curve  is  the  reaction 
function  of  player  1  and  the 
thick  gray  curve  is  the 
reaction  function  of  player  2. 
The  point  qs  =  q~y£)  is 

the  Stackelberg  equilibrium 
outcome:  it  is  the  point  on  the 
reaction  curve  of  player  2 
where  player  1  maximizes 
profit.  The  associated 
isoproht  curve  of  player  1  is 
drawn 


this  case,  the  equilibrium  outcome  is: 

q\  —  (a  —  c)/ 2,  qs2  —  {a  —  c)/4  . 

The  letter  ‘S’  here  is  the  first  letter  of  ‘Stackelberg’,  after  whom  this  equilibrium 
is  named.  More  precisely,  this  subgame  perfect  equilibrium  (or  outcome)  is  called 
the  Stackelberg  equilibrium  (or  outcome )  with  player  1  as  the  leader  and  player  2 
as  the  follower.  Check  that  player  l’s  profit  in  this  equilibrium  is  higher  and  player 
2’s  profit  is  lower  than  in  the  Cournot  equilibrium  q\  —  q^  —  (a  —  c)/ 3.  See  also 
Problem  6.9. 

The  Stackelberg  equilibrium  is  depicted  in  Fig.  6.4.  Observe  that  player  1,  the 
leader,  picks  the  point  on  the  reaction  curve  of  player  2  which  has  maximal  profit 
for  player  1 .  Hence,  player  2  is  on  his  reaction  curve  but  player  1  is  not. 


6.5  Auctions 

An  auction  is  a  procedure  to  sell  goods  among  various  interested  parties,  such  that 
the  prices  are  determined  in  the  procedure.  Examples  range  from  selling  a  painting 
through  an  ascending  bid  auction  (English  auction)  and  selling  flowers  through  a 
descending  bid  auction  (Dutch  auction)  to  tenders  for  public  projects  and  selling 
mobile  telephone  frequencies. 

In  this  section  we  consider  a  few  simple,  classical  auction  models.  We  start  with 
first  and  second-price  sealed-bid  auctions  under  complete  information,  continue 
with  a  first-price  sealed  bid  auction  with  incomplete  information,  and  end  with  a 
double  auction  between  a  buyer  and  a  seller.  Some  variations  and  extensions  are 
discussed  in  Problems  6.10-6.14. 
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6.5.1  Complete  Information 


Consider  n  individuals  who  are  interested  in  one  indivisible  object.  Each  individual  i 
has  valuation  Vi  >  0  for  the  object.  We  assume  without  loss  of  generality  v\  >  i>2  > 
. . .  >  vn.  In  a  first-price  sealed-bid  auction  each  individual  submits  a  bid  bi  >  0  for 
the  object:  the  bids  are  simultaneous  and  independent  (‘sealed  bids’).  The  individual 
with  the  highest  bid  wins  the  auction  and  obtains  the  object  at  a  price  equal  to  his 
own  bid  (‘first  price’).  In  case  there  are  more  highest  bidders,  the  bidder  among 
these  with  the  lowest  number  wins  the  auction  and  pays  his  own  bid — this  is  just 
a  tie-breaking  rule,  which  can  be  replaced  by  alternative  tie-breaking  assumptions 
without  affecting  the  basic  results. 

This  situation  gives  rise  to  a  game  with  player  set  N  =  {1,2 , ,n},  where  each 
player  i  has  strategy  set  Si  —  [0,  oo)  with  typical  element  bi.  The  payoff  function  to 
player  i  is4 


Vi  —  bi  if  i  —  min{k  e  N  |  bk  >  bj  for  all  j  e  N} 
0  otherwise. 


One  Nash  equilibrium  in  this  game  is  the  strategy  combination  (b\ , . . . ,  bn)  — 
(i>2,  r>2,  r>3, . . . ,  vn).  To  check  this  one  should  verify  that  no  player  has  a  better  bid, 
given  the  bids  of  the  other  players:  see  Problem  6.10.  In  this  equilibrium,  player  1 
obtains  the  object  and  pays  i>2,  the  second-highest  valuation.  Check  that  this  is  also 
the  outcome  one  would  approximately  expect  in  an  auction  with  ascending  bids 
(English  auction)  or  descending  bids  (Dutch  auction). 

This  game  has  many  Nash  equilibria.  In  each  of  these  equilibria,  however,  a 
player  with  a  highest  valuation  obtains  the  object.  Bidding  one’s  true  valuation  as 
well  as  bidding  higher  than  one’s  true  valuation  are  weakly  dominated  strategies. 
Bidding  lower  than  one’s  true  valuation  is  not  weakly  dominated.  (See  Sect.  6.1 
for  the  definition  of  weak  domination.)  Problem  6.10  is  about  proving  all  these 
statements. 

A  second-price  sealed-bid  auction  differs  from  a  first-price  sealed-bid  auction 
only  in  that  the  winner  now  pays  the  bid  of  the  second  highest  bidder.  In  the  case 
that  two  or  more  players  have  the  highest  bid  the  player  with  the  lowest  number 
wins  and  pays  his  own  bid.  The  main  property  of  this  auction  is  that  for  each  player 
/,  the  strategy  of  bidding  the  true  valuation  Vi  weakly  dominates  all  other  strategies. 
This  property  and  other  properties  are  collected  in  Problem  6.1 1. 


4 Also  here  the  assumption  is  that  the  players  know  the  game.  This  means,  in  particular,  that  the 
players  know  each  other’s  valuations. 
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6.5.2  Incomplete  Information 

We  consider  the  same  setting  as  in  Sect.  6.5.1  but  now  assume  that  each  bidder 
knows  his  own  valuation  but  has  only  a  probabilistic  estimate  about  the  valuations 
of  the  other  bidders.  In  the  terminology  of  types  (cf.  Sect.  5.1),  a  bidder’s  valuation 
is  his  true  type,  and  each  bidder  holds  a  probability  distribution  over  the  type 
combinations  of  the  other  bidders.  To  keep  things  simple,  we  assume  that  every 
bidder’s  type  is  drawn  independently  from  the  uniform  distribution  over  the  interval 
[0,  1],  that  this  is  common  knowledge,  and  that  each  bidder  learns  his  true  type. 
The  auction  is  a  first-price  sealed-bid  auction.  Of  course,  we  can  no  longer  fix  the 
ordering  of  the  valuations,  but  we  can  still  employ  the  same  tie-breaking  rule  in  case 
of  more  than  one  highest  bid. 

We  discuss  the  case  of  two  bidders  and  postpone  the  extension  to  n  >  2  bidders 
until  Problem  6. 13.  In  the  associated  two-person  game,  a  strategy  of  player  i  e  {1,2} 
should  assign  a  bid  to  each  of  his  possible  types.  Since  the  set  of  possible  types  is 
the  interval  [0, 1]  and  it  does  not  make  sense  to  ever  bid  more  than  1,  a  strategy  is 
a  function  si  :  [0,  1]  ->  [0, 1].  Hence,  if  player  V s  type  is  vu  then  bi  —  Si(vi )  is  his 
bid  according  to  the  strategy  S[.  The  payoff  function  U[  of  player  i  assigns  to  each 
strategy  pair  (st,  sj )  (where  j  is  the  other  player)  player  V s  expected  payoff  if  these 
strategies  are  played.  In  a  Nash  equilibrium  of  the  game,  player  i  maximizes  this 
payoff  function  given  the  strategy  of  player  j ,  and  vice  versa.  For  this,  it  is  sufficient 
that  each  type  of  player  i  maximizes  its  expected  payoff  given  the  strategy  of  player 
j,  and  vice  versa ;  in  other  words  (cf.  Chap.  5),  it  is  sufficient  that  the  strategies  form 
a  Bayesian  equilibrium. 

We  claim  that  s*(v\)  =  v\/2  and  s*(v2 )  =  V2/2  is  a  Nash  equilibrium  of  this 
game.  To  prove  this,  first  consider  type  v\  of  player  1  and  suppose  that  player  2  plays 
strategy  s*.  If  player  1  bids  b\ ,  then  the  probability  that  player  1  wins  the  auction  is 
equal  to  the  probability  that  the  bid  of  player  2  is  smaller  than  or  equal  to  b\ .  This 
probability  is  equal  to  the  probability  that  1*2/ 2  is  smaller  than  or  equal  to  b  1,  i.e., 
to  the  probability  that  V2  is  smaller  than  or  equal  to  2 b\.  We  may  assume  without 
loss  of  generality  that  b\  <  1/2,  since  according  to  s*  player  2  will  never  bid  higher 
than  1/2.  Since  i?2  is  uniformly  distributed  over  the  interval  [0,  1]  and  2b\  <  1,  the 
probability  that  i>2  is  smaller  than  or  equal  to  2b\  is  just  equal  to  2b\.  Hence,  the 
probability  that  the  bid  b\  of  player  1  is  winning  is  equal  to  2b \  if  player  2  plays  s* , 
and  therefore  the  expected  payoff  from  this  bid  is  equal  to  2b\(v\  —  b\)  (if  player 
1  loses  his  payoff  is  zero).  This  is  maximal  for  b\  —  V\/2.  Hence,  s*  (v\)  =  V\/2 
is  a  best  reply  to  s* .  The  converse  is  almost  analogous — the  only  difference  being 
that  for  player  2  to  win  player  1  ’s  bid  must  be  strictly  smaller  due  to  the  tie-breaking 
rule  employed,  but  this  does  not  change  the  associated  probability  under  the  uniform 
distribution.  Hence,  we  have  proved  the  claim. 


5  The  difference  is  that,  for  instance,  single  types  may  not  play  a  best  reply  in  a  Nash  equilibrium 
since  they  have  probability  zero  and  therefore  do  not  influence  the  expected  payoffs. 
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Thus,  in  this  equilibrium,  each  bidder  bids  half  his  true  valuation,  and  a  player 
with  the  highest  valuation  wins  the  auction. 

How  about  the  second-price  sealed-bid  auction  with  incomplete  information? 
This  is  more  straightforward  since  bidding  one’s  true  valuation  (Si(vi)  —  Vi  for  all 
Vi  e  [0,  1])  is  a  strategy  that  weakly  dominates  every  other  strategy,  for  each  player  i. 
Hence,  these  strategies  still  form  a  (Bayesian)  Nash  equilibrium.  See  Problem  6.1 1. 


6.5.3  Incomplete  Information:  A  Double  Auction 


Assume  there  are  two  players,  a  buyer  and  a  seller.  The  seller  owns  an  object,  for 
which  he  has  a  valuation  vs.  The  buyer  has  a  valuation  Vb  for  this  object.  These 
valuations  are  independently  drawn  from  the  uniform  distribution  over  [0,1].  The 
seller  knows  (learns)  his  own  valuation,  and  also  the  buyer  knows  his  own  valuation, 
but  each  of  them  does  not  know  the  valuation  of  the  other  player,  only  that  this  is 
drawn  from  the  uniform  distribution  over  [0,  1]. 

The  auction  works  as  follows.  The  buyer  and  the  seller  independently  and 
simultaneously  mention  prices  pb  and  ps.  If  pb  >  ps ,  then  trade  takes  place  at  the 
average  price  p  —  (pb+  ps)  / 2,  and  the  payoffs  are  Vb  —  p  to  the  buyer  and  p  —  vs  to 
the  seller.  If  pb  <  ps  then  no  trade  takes  place  and  both  have  payoff  0. 

This  is  a  game  of  incomplete  information.  The  buyer  has  infinitely  many  types 
Vb  e  [0,1],  and  the  seller  has  infinitely  many  types  vs  e  [0,  1].  A  strategy  assigns  a 
price  to  each  type.  A  strategy  for  the  buyer  is  therefore  a  function pb  :  [0, 1]  ->  [0, 1], 
where  Pb(vb)  is  the  price  that  the  buyer  offers  if  his  type  (valuation)  is  Vb.  (Observe 
that  we  may  indeed  assume  that  the  price  is  never  higher  than  1 .)  Similarly,  a  strategy 
for  the  seller  is  a  function ps  :  [0, 1]  ->  [0, 1],  where ps(vs)  is  the  price  that  the  seller 
asks  if  his  type  (valuation)  is  vs. 

Suppose  the  seller  plays  a  strategy  ps(-).  Then  the  expected  payoff  to  the  buyer  if 
his  valuation  is  Vb  and  he  offers  price  pb  is  equal  to 

pb  +  E\ps(vs)\pb  >  M^)] 


Prob \pb  >  ps(vs)]  (6.6) 


where  E\ps(vs)  \pb  >  ps(vs)]  denotes  the  expected  price  asked  by  the  seller  according 
to  his  strategy  ps( •),  conditional  on  this  price  being  smaller  than  or  equal  to  the  price 
Pb  of  the  buyer. 

Similarly,  suppose  the  buyer  plays  a  strategy  pb( •)•  Then  the  expected  payoff  to 
the  seller  if  his  valuation  is  vs  and  he  asks  price  ps  is  equal  to 


Ps  +  E\pb(vb)\ps  < Pb(Vb)\ 


Prob \ps  <  pb(vb)] 


(6.7) 


where  E\pb(vb)\ps  <  PbiVb)]  denotes  the  expected  price  offered  by  the  buyer 
according  to  his  strategy  pb( •)>  conditional  on  this  price  being  larger  than  or  equal 
to  the  price  ps  of  the  seller. 


6.6  Mixed  Strategies  and  Incomplete  Information 
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Now  the  pair  of  strategies  (pb(m)>Ps(0)  *s  a  (Bayesian)  Nash  equilibrium  if  for 
each  Vb  e  [0,  1  ],Pb(Vb)  solves 


max 
Pb^[  o,i] 


Vb 


Pb  +E\ps(vs)\pb  >  Ps(vs)] 


Prob| pb  >  a(^)] 


(6.8) 


and  for  each  vs  e  [0,  1],  ps(vs )  solves 


max 

^e[0,l] 


' Ps  +  E[pb(vb)\ps  < 


-  V , 


Prob[p,  <  /ty(v*)] 


(6.9) 


This  game  has  many  Nash  equilibria.  Ideally,  trade  should  take  place  whenever  it 
is  efficient,  i.e.,  whenever  Vb  >  F?.  In  Problem  6.14  we  will  see  that  not  all  Nash 
equilibria  are  equally  efficient. 


6.6  Mixed  Strategies  and  Incomplete  Information 

In  this  section  we  discuss  how  a  mixed  strategy  Nash  equilibrium  in  a  bimatrix  game 
can  be  obtained  as  a  limit  of  pure  strategy  Bayesian  Nash  equilibria  in  associated 
games  of  incomplete  information. 

Consider  the  bimatrix  game  (cf.  Chap.  3) 

L  R 

T  (2,1  2,0\ 

B  y  3, 0  1,3  )  9 

which  has  a  unique  Nash  equilibrium  ((/?*,  1  —/?*),  (g*,  1  —  q *))  with  p*  =  3/4 
and  g*  =  1/2.  The  interpretation  of  mixed  strategies  and  of  a  mixed  strategy 
Nash  equilibrium  in  particular  is  an  old  issue  in  the  game-theoretic  literature.  One 
obvious  interpretation  is  that  a  player  actually  plays  according  to  the  equilibrium 
probabilities.  Although  there  is  some  empirical  evidence  that  this  may  occur  in 
reality,6  this  interpretation  may  not  be  entirely  convincing,  in  particular  since  in 
a  mixed  strategy  Nash  equilibrium  a  player  is  indifferent  between  all  pure  strategies 
played  with  positive  probability  (cf.  Problem  3.8).  An  alternative  interpretation — 
also  mentioned  in  Sect.  3.1 — is  that  a  mixed  strategy  of  a  player  represents  the 
belief(s)  of  the  other  player(s)  about  the  strategic  choice  of  that  player.  For  instance, 
in  the  above  equilibrium,  player  2  believes  that  player  1  plays  T  with  probability  3/4. 
The  drawback  of  this  interpretation  is  that  these  beliefs  are  subjective,  and  it  is  not 
explained  how  they  are  formed.  In  this  section  we  discuss  a  way  to  obtain  a  mixed 
strategy  Nash  equilibrium  as  the  limit  of  pure  strategy  (Bayesian)  Nash  equilibria 
in  games  obtained  by  adding  some  objective  uncertainty  about  the  payoffs.  In  this 


6 For  example,  Walker  and  Wooders  (2001). 
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way,  the  strategic  uncertainty  of  players  as  expressed  by  their  beliefs  is  replaced  by 
the  objective  uncertainty  of  a  chance  move. 

In  the  above  example,  suppose  that  the  payoff  to  player  1  from  ( T ,  L)  is  the 
uncertain  amount  2  +  a  and  the  payoff  to  player  2  from  ( B ,  R )  is  the  uncertain 
amount  3  +  /3.  Assume  that  both  a  and  /3  are  (independently)  drawn  from  a  uniform 
distribution  over  the  interval  [0,  jc],  where  v  >  0.  Moreover,  player  1  learns  the  true 
value  of  a  and  player  2  learns  the  true  value  of  /3,  and  all  this  is  common  knowledge 
among  the  players.  In  terms  of  types,  player  1  knows  his  type  a  and  player  2  knows 
his  type  /3.  The  new  payoffs  are  given  by 

L  R 

T  ( 2  +  a,  1  2,0  \ 

B  V  3,0  1, 3  +  /3  ) 


A  (pure)  strategy  of  a  player  assigns  an  action  to  each  of  his  types.  Hence,  for  player 
1  it  is  a  map  s\  :  [0,  x]  ->  {T,  B}  and  for  player  2  it  is  a  map  S2  :  [0,  x]  ->  {L,  R}. 

To  find  an  equilibrium  of  this  incomplete  information  game,  suppose  that  player  2 
has  the  following  rather  simple  strategy:  play  L  if  ^  is  small  and  play  R  if  /3  is  large. 
Specifically,  let  b  e  [0,v]  such  that  each  type  /3  <  b  plays  L  and  each  type  >  b 
plays  R.  Call  this  strategy  s2.  What  is  player  l’s  best  reply  against  sb2l  Suppose  the 
type  of  player  1  is  a.  If  player  1  plays  T ,  then  his  expected  payoff  is  equal  to  2  +  a 
times  the  probability  that  player  2  plays  L  plus  2  times  the  probability  that  player  2 
plays  R.  The  probability  that  player  2  plays  L,  given  the  strategy  sb,  is  equal  to  the 
probability  that  /3  is  at  most  equal  to  b ,  and  this  is  equal  to  b/x  since  /3  is  uniformly 
distributed  over  [0,  x\.  Hence,  the  expected  payoff  to  player  1  from  playing  T  is 


.  x  b  ,  b  x  b 

(2  +  a)  •  — |-  2  •  (1 - )  =  2  +  a  •  - 

Similarly,  the  expected  payoff  to  player  1  from  playing  B  is 


b  ,  Z?  b 

3  -  — hl-(l - )  =  1  +  2  •  - 


From  this,  it  easily  follows  that  T  is  at  least  as  good  as  B  if  a  >  (2b  —  x)/b.  Hence, 
the  following  strategy  of  player  1  is  a  best  reply  against  strategy  sb  of  player  2:  play 
T  if  a  >  a  and  play  B  if  a  <  a,  where  a  =  (2b  —  x)/b.  Call  this  strategy  sc[. 

For  the  converse,  assume  that  player  1  plays  sav  To  find  player  2’s  best  reply 
against  sax  we  proceed  similarly  as  above.  If  type  /3  of  player  2  plays  L  then  the 
expected  payoff  is  1  times  the  probability  that  player  1  plays  T,  hence  1  times  (x  — 
a) / x.  If  type  /3  of  player  2  plays  R  then  his  expected  payoff  is  equal  to  3  +  /3  times 
the  probability  that  player  1  plays  B ,  hence  (3  +  j3)a/x.  So  L  is  at  least  as  good  as  R 
if  P  <  (x  —  4a) /a.  Hence,  a  best  reply  of  player  2  against  sx  is  the  strategy  sb  with 
b  —  (x  —  4a) /a. 


6.7  Sequential  Bargaining 


105 


Summarizing  these  arguments,  we  have  that  (s^s^)  is  a  (Bayesian)  Nash 
equilibrium  for 


a  —  (2 b  —  x)/b,  b  —  (x  —  4 a) / a  . 

Solving  these  two  equations  simultaneously  for  solutions  a,  b  e  [0,  x]  yields: 

a  =  (1/4) (v +  4—  \J x1  +  16) ,  b  =  (1  /2) (x  —  4  +  \/v2  +  16)  . 

In  this  equilibrium,  the  a  priori  probability  that  player  1  will  play  T,  that  is,  the 
probability  of  playing  T  before  he  learns  his  type,  is  equal  to  (x  —  a)/ x,  hence  to 
(V*2  +  16  +  3x  —  4)/ 4x.  Similarly,  the  a  priori  probability  that  player  2  plays  L  is 
equal  to  b/x,  hence  to  (x— 4  +  Vv2  +16)  /  2x.  What  happens  with  these  probabilities 
as  the  amount  of  uncertainty  decreases,  i.e.,  for  v  approaching  0?  For  player  1, 

V*2  +  16  +  3jc  -  4  jc/Vv2  +  16  +  3  3 

lim -  =  lim -  =  -  , 

x-+o  4x  x->o  4  4 

where  the  first  equality  follows  from  FHopital’s  rule.  Similarly  for  player  2: 

x  —  4  +  \j x2  +  16  1  +  x/  a/ x2  +  16  1 

lim -  =  lim -  =  -  . 

x — 2x  x— >0  2  2 

In  other  words,  these  probabilities  converge  to  the  mixed  strategy  Nash  equilibrium 
of  the  original  game. 


6.7  Sequential  Bargaining 

In  its  simplest  version,  the  bargaining  problem  involves  two  parties  who  have 
to  agree  on  one  alternative  within  a  set  of  feasible  alternatives.  If  they  fail  to 
reach  an  agreement,  a  specific  ‘disagreement’  alternative  is  implemented.  In  the 
game-theoretic  literature  on  bargaining  there  are  two  main  strands,  namely  the 
cooperative,  axiomatic  approach,  also  known  as  the  Nash  bargaining  problem,  and 
the  noncooperative,  strategic  approach,  with  the  Rubinstein  alternating  offers  game 
as  main  representative.  In  this  section  the  focus  is  on  the  strategic  approach,  but  in 
Sect.  6.7.2  we  also  mention  the  connection  with  the  Nash  bargaining  solution.  For  an 
introduction  to  the  axiomatic  bargaining  approach  see  Sect.  10.1,  and  see  Chap.  21 
for  a  more  extensive  study. 
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6.7.1  Finite  Horizon  Bargaining 

Consider  the  example  in  Sect.  1.3.5,  where  two  players  bargain  over  the  division  of 
one  unit  of  a  perfectly  divisible  good,  e.g.,  1  liter  of  wine.  If  they  do  not  reach  an 
agreement,  we  assume  that  no  one  gets  anything.  To  keep  the  problem  as  simple 
as  possible,  assume  that  the  preferences  of  the  players  are  represented  by  u\  (a)  = 
U2(a)  =  a  for  every  a  e  [0, 1].  That  is,  obtaining  an  amount  a  of  the  good  has 
utility  a  for  each  player.  In  this  case  the  picture  of  the  feasible  set  in  Sect.  1.3.5 
would  be  a  triangle. 

To  model  the  bargaining  process  we  consider  the  following  alternating  offers 
procedure.  There  are  T  +  1  rounds,  where  T  e  N.  In  round  t  —  0  player  1 
makes  a  proposal,  say  (a,  l  —  a),  where  a  £  [0, 1],  meaning  that  he  claims  an 
amount  a  for  himself,  so  that  player  2  obtains  l  —  a.  Player  2  can  either  accept  this 
proposal,  implying  that  the  proposal  is  implemented  and  the  game  is  over,  or  reject 
the  proposal.  In  the  latter  case  the  next  round  t  —  1  starts,  and  the  first  round  is 
repeated  with  the  roles  of  the  players  interchanged:  player  2  makes  the  proposal  and 
player  1  accepts  or  rejects  it.  If  player  1  accepts  the  proposal  then  it  is  implemented 
and  the  game  is  over;  if  player  1  rejects  the  proposal  then  round  t  —  2  starts,  and  the 
roles  of  the  players  are  interchanged  again.  Thus,  at  even  rounds,  player  1  proposes; 
at  odd  rounds,  player  2  proposes.  The  last  possible  round  is  round  T :  if  this  round  is 
reached,  then  the  disagreement  alternative  (0, 0)  is  implemented. 

We  assume  that  utilities  are  discounted.  Specifically,  there  is  a  discount  factor 
0  <  8  <  1  such  that  receiving  an  amount  a  at  round  t  has  utility  8ra  at  round  0.  Or, 
receiving  an  amount  a  at  time  t  has  utility  8a  at  round  t  —  1 .  This  reflects  the  fact 
that  receiving  the  same  amount  earlier  is  more  valuable. 

In  Fig.  6.5  this  bargaining  procedure  is  represented  as  a  game  in  extensive  form. 
Here,  we  assume  that  T  is  odd,  so  that  the  last  proposal  at  time  t  —  T  —  1  is  made 
by  player  1 . 

We  look  for  a  subgame  perfect  equilibrium  of  this  game,  which  can  be  found 
by  backward  induction.  Note  that  subgames  start  at  each  decision  node  of  a  player, 
and  that  at  each  such  node  in  Fig.  6.5  where  a  player  has  to  make,  accept  or  reject 


t  =  o 


t  =  l 


t  =  2 


t  =  T  -  1 


t  =  T 


1  2 

•vAA/f- 
OL  T  R 


2  1 

-#vAA/f- 

a  T  R 


1  2  2 

-#AA/f - • 

at/? 


1  1  2 

ft - #vAA/f- 

R  a  T  R 


(0,0) 


(a,  l  -  a)  5(a,  l  -  a)  52(a,  l  -  a)  8T  2 (a,  1  -  a)  8T  l(a,  1  -  a) 


Fig.  6.5  The  extensive  form  representation  of  the  finite  horizon  bargaining  procedure.  The 
number  of  rounds  T  +  1  is  even,  a  denotes  the  proposed  amount  for  player  1,  A  is  acceptance 
and  R  is  rejection 
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t  =  0  t  =  1 

1  2  2 


t  =  2  t  =  T  -  2  t  =  T  -  1 

1  2  2  2  2  1 


•v/WO - — #v/WO - - #  •••  • - #v/\/\y# 

a 

U  a 

U  a 

K 

R  a 

A 

A 

A 

A 

A 

i 

*  i 
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>  i 

»  • 

7? 


(a,  1  -  a)  5(a,  1-a)  1  -  a)  6~30,  1  -  a)  <F“  2(a,  1  -  a) 


Fig.  6.6  The  game  of  Fig.  6.5  reduced  by  replacing  rounds  T  —  1  and  T  by  the  equilibrium  payoffs 
of  the  associated  subgames 


a  proposal,  actually  infinitely  many  subgames  start,  since  there  are  infinitely  many 
possible  paths  leading  to  that  node. 

To  start  the  analysis,  at  the  final  decision  node,  player  2  accepts  if  o'  <  1  and 
is  indifferent  between  acceptance  and  rejection  if  a  =  1 :  this  is  because  rejection 
results  in  getting  0  at  round  T .  In  the  subgame  starting  at  round  T—  1  with  a  proposal 
of  player  1,  the  only  equilibrium  therefore  is  for  player  1  to  propose  a  =  1  and  for 
player  2  to  accept  any  proposal:  if  player  2  would  reject  a  =  1  then  player  1  could 
improve  by  proposing  0  <  a  <  1 ;  hence,  we  only  have  an  equilibrium  if  player  2 
accepts  a  =  1  and,  clearly,  proposing  a  =  1  is  optimal  for  player  1 .  Hence,  we  can 
replace  the  part  of  the  game  from  round  T  —  1  on  by  the  pair  of  payoffs  (8T~l ,  0),  as 
in  Fig.  6.6. 

In  this  reduced  game,  in  the  subgame  starting  at  round  T  —  2,  player  1  can  obtain 
8T~{  by  rejecting  player  2’s  proposal.  Recall  that  this  is  the  discounted  utility  at  time 
t  —  0  of  receiving  an  amount  1  at  time  T  —  1 ;  the  discounted  utility  of  the  amount 

1  at  time  T  —  2  is  equal  to  8.  Hence,  in  a  backward  induction  equilibrium  player 

2  proposes  a  =  8  and  player  1  accepts  this  proposal  or  any  higher  a  and  rejects 
any  lower  a.  Hence,  we  can  replace  this  whole  subgame  by  the  pair  of  payoffs 
(< 8T~28 ,  8T~2(  1  —5))  =  (5r_1, 8T~2(l  —  S)).  Continuing  this  line  of  reasoning,  in  the 
subgame  starting  at  round  T  —  3,  player  1  proposes  a  —  1  —  5(1  —  8),  which  will  be 
accepted  by  player  2.  This  results  in  the  payoffs  (5r_3(l  —  5(1  —  8)),  8T~2(  1  —  8)). 
This  can  be  written  as  (5r-3(l  —  8  +  82),  8T~3(8  —  82)).  And  so  on  and  so  forth. 
The  general  principle  is  that  each  player  offers  the  other  player  a  share  equal  to  8 
times  the  share  the  other  player  can  expect  in  the  next  round. 

By  backtracking  all  the  way  to  round  0  (see  Table  6.1),  we  find  that  player  1 
proposes  1  —  8  -\-  82  —  8T~l  and  player  2  accepts  this  proposal,  resulting  in 
the  payoffs  1  —  8  +  82  —  . . .  +  8T~l  for  player  1  and  8  —  82  +  ...  —  8T~l  for  player 
2.  This  is  the  subgame  perfect  equilibrium  outcome  of  the  game  and  the  associated 
payoffs.  This  outcome  is  the  path  of  play,  induced  by  the  following  subgame  perfect 
equilibrium : 

•  At  even  rounds  t ,  player  1  proposes  a  —  1  —  5  + . . .  +  w  and  player  2  accepts 

this  proposal  or  any  smaller  o',  and  rejects  any  larger  a. 
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Table  6.1  The  proposals  made  in  the  subgame  perfect  equilibrium 


Round 

Proposer 

Share  for  player  1 

Share  for  player  2 

T 

0 

0 

T-  1 

1 

1 

0 

T-2 

2 

1  -(5 

T  -  3 

1 

1  -  8  +  S2 

s-s2 

T -4 

2 

8-82  +  83 

l-8  +  S2-83 

• 

• 

• 

• 

0 

1 

1  -  8  +  82  -  . . .  +  8T-' 

<5  -  <$2  +  . . .  -  <5r-' 

•  At  odd  rounds  f,  player  2  proposes  a  —  8  —  82  8T  1  T  and  player  1  accepts 

this  proposal  or  any  larger  a ,  and  rejects  any  smaller  a. 

In  Problem  6.16  some  variations  on  this  finite  horizon  bargaining  game  are 
discussed. 


6.7.2  Infinite  Horizon  Bargaining 

In  this  subsection  we  consider  the  same  bargaining  problem  as  in  the  previous 
subsection,  but  now  we  assume  T  =  oo:  the  number  of  rounds  may  potentially 
be  infinite.  If  no  agreement  is  ever  reached,  then  no  player  obtains  anything.  This 
game,  like  the  finite  horizon  game,  has  many  Nash  equilibria:  see  Problem  6.16(f). 

One  way  to  analyze  the  game  is  to  consider  the  finite  horizon  case  and  take  the 
limit  as  T  approaches  infinity:  see  Problem  6.16(e).  In  fact,  the  resulting  distribution 
is  the  uniquely  possible  outcome  of  a  subgame  perfect  equilibrium,  as  can  be  seen 
by  comparing  the  answer  to  Problem  6.16(e)  with  the  result  presented  below.  Of 
course,  this  claim  is  not  proved  by  just  taking  the  limit. 

Note  that  a  subgame  perfect  equilibrium  cannot  be  obtained  by  backward 
induction,  since  the  game  has  no  final  decision  nodes.  Here,  we  will  just  describe  a 
pair  of  strategies  and  show  that  they  are  a  subgame  perfect  equilibrium  of  the  game. 
A  proof  that  the  associated  outcome  is  the  unique  outcome  resulting  in  any  subgame 
perfect  equilibrium  can  be  found  in  the  literature. 

Let  x*  =  (v*,v*)  and  y*  =  (y*,y*)  be  such  that  v* ,  v* , y* , y*  —  0,  x*  +  v*  = 
y*  +  y*  —  1  >  and  moreover 

v*  =  Sy*,  y*x  =  8x*  .  (6.10) 

It  is  not  difficult  to  verify  that  x*  =  (1/(1  +  8),  8/(1  +  5))  and  y*  =  (8/(1  + 
8),  1/(1  +  5)).  Consider  the  following  strategies  for  players  1  and  2,  respectively: 

(cr*)  At  t  =  0, 2, 4, . . .  propose  x*;  at  t  —  1,3,5,...  accept  a  proposal  z  =  (zi ,  z2) 
of  player  2  if  and  only  if  z\  >  8x*. 
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(cr*)  At  t  —  1,3,5,...  propose  y*;  at  t  —  0, 2, 4, . . .  accept  a  proposal  z  =  (zi ,  Z2) 
of  player  1  if  and  only  if  zi  >  5y  *  • 

These  strategies  are  stationary:  the  players  always  make  the  same  proposals. 
Moreover,  a  player  accepts  any  proposal  that  offers  him  at  least  the  discounted  value 
of  his  own  demand.  According  to  (6. 10),  player  2  accepts  the  proposal  x*  and  player 
1  accepts  the  proposal  y*.  Hence,  play  of  the  strategy  pair  (cr*,  cr*)  results  in  player 
l’s  proposal  x*  =  (1/(1 +5),  5/(1 +5))  being  accepted  at  round  0,  so  that  these 
are  also  the  payoffs.  We  will  show  that  (cr* ,  cr*)  is  a  subgame  perfect  equilibrium  of 
the  game. 

To  show  this,  note  that  there  are  two  kinds  of  subgames:  subgames  where  a  player 
has  to  make  a  proposal;  and  subgames  where  a  proposal  is  on  the  table  and  a  player 
has  to  choose  between  accepting  and  rejecting  the  proposal. 

For  the  first  kind  of  subgame  we  may  without  loss  of  generality  consider  the 
entire  game,  i.e.,  the  game  starting  at  t  —  0.  We  have  to  show  that  (cr*,  cr*)  is  a 
Nash  equilibrium  in  this  game.  First,  suppose  that  player  1  plays  cr*.  By  accepting 
player  l’s  proposal  at  t  —  0,  player  2  has  a  payoff  of  5/(1  +  5).  By  rejecting  this 
proposal,  the  maximum  player  2  can  obtain  against  cr*  is  5/(1  +  5),  by  proposing 
y*  in  round  t  —  1.  Proposals  z  with  zi  >  y*  and  thus  zi  <  y*  are  rejected  by  player 
1.  Hence,  cr*  is  a  best  reply  against  cr*.  Similarly,  if  player  2  plays  cr*,  then  the  best 
player  1  can  obtain  is  v*  at  t  —  0  with  payoff  1/(1  +  5),  since  player  2  will  reject 
any  proposal  that  gives  player  1  more  than  this,  and  also  does  not  offer  more. 

For  the  second  kind  of  subgame,  again  we  may  without  loss  of  generality  take 
t  —  0  and  assume  that  player  1  has  made  some  proposal,  say  z  =  (z\ ,  zi) — the 
argument  for  t  odd,  when  there  is  a  proposal  of  player  2  on  the  table,  is  analogous. 
First,  suppose  that  in  this  subgame  player  1  plays  cr* .  If  zi  >  5y* ,  then  accepting 
this  proposal  yields  player  2  a  payoff  of  zi  >  5y^  =  5/(1  +5).  By  rejecting,  the 
maximum  player  2  can  obtain  against  cr*  is  5/(1  +5)  by  proposing  y*  at  t  =  1, 
which  will  be  accepted  by  player  1 .  If,  on  the  other  hand,  zi  <  5y* ,  then  player  2 
can  indeed  better  reject  z  and  obtain  5/(1  +  5)  by  proposing  y*  at  t  =  1.  Hence, 
cr*  is  a  best  reply  against  cr* .  Next,  suppose  player  2  plays  cr  * .  Then  z  is  accepted 
if  Z2  >  5y*  and  rejected  otherwise.  In  the  first  case  it  does  not  matter  how  player 
1  replies,  and  in  the  second  case  the  game  starts  again  with  player  2  as  the  first 
proposer,  and  by  an  argument  analogous  to  the  argument  in  the  previous  paragraph, 
player  l’s  best  reply  is  cr*. 

We  have,  thus,  shown  that  (cr* ,  cr  * )  is  a  subgame  perfect  equilibrium  of  the  game. 
In  Problem  6.17  some  variations  on  this  game  are  discussed. 

We  conclude  this  section  with  two  remarks. 

Remark  6.1  Nothing  in  the  whole  analysis  changes  if  we  view  the  number  5  not 
as  a  discount  factor  but  as  the  probability  that  the  game  continues  to  the  next 
round.  Specifically,  if  a  proposal  is  rejected,  then  assume  that  with  probability 
1—5  the  game  stops  and  each  player  receives  0,  and  with  probability  5  the  game 
continues  in  the  usual  way.  Under  this  alternative  interpretation,  the  game  ends  with 
probability  1  [Problem  6.17(e)],  which  makes  the  infinite  horizon  assumption  more 
acceptable.  □ 
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Remark  6.2  In  the  subgame  perfect  equilibrium  of  the  infinite  horizon  game  the 
shares  are  1/(1  +  5)  for  player  1  and  5/(1  +5)  for  player  2.  For  5  converging  to 
1,  i.e.,  the  players  becoming  more  patient,  these  shares  converge  to  1/2  for  each, 
which  is  the  Nash  bargaining  outcome  of  the  game,  arising  from  maximizing  the 
product  a(l  —  a)  for  0  <  a  <  1  (cf.  Sect.  1.3.5).  This  is  true  more  generally,  i.e., 
also  if  the  utility  functions  are  more  general.  See  Sects.  10.1.2  and  21.4.  □ 


6.8  Problems 

6.1.  Cournot  with  Asymmetric  Costs 

Consider  the  Cournot  model  of  Sect.  6.2.1  but  now  assume  that  the  firms  have 
different  marginal  costs  ci,c2  >  0.  Assume  0  <  ci,c2  <  a  and  a  >  2c\  —  C2 , 
a  >  2c2  —  c\.  Compute  the  Nash  equilibrium. 

6.2.  Cournot  Oligopoly 

Consider  the  Cournot  model  of  Sect.  6.2.1  but  now  assume  that  there  are  n  firms 
instead  of  two.  Each  firm  i  —  1 , . . . ,  n  offers  qt  >  0  and  the  market  price  is 

P{q\ qn)  —  ma  x{a  -  q\  —  . . .  —  qn,  0}  . 

Each  firm  still  has  marginal  cost  c  with  a  >  c  >  0  and  no  fixed  costs. 

(a)  Formulate  the  game  corresponding  to  this  situation.  In  particular,  write  down 
the  payoff  functions. 

(b)  Derive  the  reaction  functions  of  the  players. 

(c)  Derive  a  Nash  equilibrium  of  the  game  by  trying  equal  quantities  offered.  What 
happens  if  the  number  of  firms  becomes  large? 

(d)  Show  that  the  Nash  equilibrium  found  in  (c)  is  unique. 

6.3.  Quantity  Competition  with  Heterogenous  Goods 

Suppose,  in  the  Cournot  model,  that  the  firms  produce  heterogenous  goods,  which 
have  different  market  prices.  Specifically,  suppose  that  these  market  prices  are 
given  by: 


pi  —  max{5  —  2q\  —  2 q2,  0},  p2  —  max{4.5  —  1.5  q\  —  3 q2,  0}  . 


The  firms  still  compete  in  quantities  and  have  equal  constant  marginal  costs  c,  with 
c  <  4.5. 

(a)  Formulate  the  game  corresponding  to  this  situation.  In  particular,  write  down 
the  payoff  functions. 

(b)  Solve  for  the  reaction  functions  and  the  Nash  equilibrium  of  this  game.  Also 
compute  the  corresponding  prices  and  profits. 
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(c)  Compute  the  quantities  at  which  joint  profit  is  maximized.  Also  compute  the 
corresponding  prices. 

In  (d)-(f),  we  assume  that  the  firms  compete  in  prices. 

(d)  Derive  the  demands  q\  and  <72  as  a  function  of  the  prices.  Set  up  the  associated 
game  where  the  prices  p\  and  P2  are  now  the  strategic  variables. 

(e)  Solve  for  the  reaction  functions  and  the  Nash  equilibrium  of  this  game.  Also 
compute  the  corresponding  quantities  and  profits. 

(f)  Compute  the  prices  at  which  joint  profit  is  maximized.  Also  compute  the 
corresponding  quantities. 

(g)  Compare  the  results  found  under  (b)  and  (c)  with  those  under  (e)  and  (f). 

6.4.  A  Numerical  Example  of  Cournot  Competition  with  Incomplete  Information 
Redo  the  model  of  Sect.  6.2.2  for  the  following  values  of  the  parameters:  a  —  1, 
c  —  0,  d  —  1/2,  cl  —  0,  ch  —  1/4.  Compute  the  Nash  equilibrium  and  compare 
with  what  was  found  in  the  text.  Also  compare  with  the  complete  information  case 
by  using  the  answer  to  Problem  6.1. 

6.5.  Cournot  Competition  with  Two-Sided  Incomplete  Information 

Consider  the  Cournot  game  of  incomplete  information  of  Sect.  6.2.2  and  assume 
that  also  firm  1  can  have  high  costs  or  low  costs,  say  cu  with  probability  Tt  and 
ci  with  probability  1  —  n.  Set  up  the  associated  game  and  compute  the  (four) 
reaction  functions.  (Assume  that  the  parameters  of  the  game  are  such  that  the  Nash 
equilibrium  quantities  are  positive  and  the  relevant  parts  of  the  reaction  functions 
can  be  found  by  differentiating  the  payoff  functions  (i.e.,  no  corner  solutions).)  How 
can  the  Nash  equilibrium  be  computed?  (You  do  not  actually  have  to  compute  it 
explicitly.) 

6.6.  Incomplete  Information  About  Demand 

Consider  the  Cournot  game  of  incomplete  information  of  Sect.  6.2.2  but  now  assume 
that  the  incomplete  information  is  not  about  the  cost  of  firm  2  but  about  market 
demand.  Specifically,  assume  that  the  number  a  can  be  either  high,  aH ,  with 
probability  d,  or  low,  ai ,  with  probability  1  —  d.  Firm  2  knows  the  value  for 
sure  but  firm  1  only  knows  these  probabilities.  Set  up  the  game  and  compute  the 
reaction  functions  and  the  Nash  equilibrium.  (Make  appropriate  assumptions  on  the 
parameters  aH,  aL,d,  and  c  to  avoid  corner  solutions.) 

6.7.  Variations  on  Two-Person  Bertrand 

(a)  Assume  that  the  two  firms  in  the  Bertrand  model  of  Sect.  6.3  have  different 
marginal  costs,  say  c\  <  C2  <  a.  Derive  the  best  reply  functions  and  find  the 
Nash-Bertrand  equilibrium  or  equilibria,  if  any. 

(b)  Reconsider  the  questions  in  (a)  for  the  case  where  prices  and  costs  are 
restricted  to  integer  values,  i.e.,  p\,p2,  c\,C2  e  {0, 1, 2, . . .}.  (This  reflects  the 
assumption  that  there  is  a  smallest  monetary  unit.)  Specifically,  consider  two 
cases:  (i)  a  —  6,  c\  —  C2  —  2  and  (ii)  a  —  6,  c\  —  1,  C2  =  2. 
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6.8.  Bertrand  with  More  Than  Two  Firms 

Suppose  that  there  are  n  >  2  firms  in  the  Bertrand  model  of  Sect.  6.3.  Assume  again 
that  all  firms  have  equal  marginal  cost  c,  and  that  the  firm  with  the  lowest  price  gets 
the  whole  market.  In  case  of  a  tie,  each  firm  with  the  lowest  price  gets  an  equal  share 
of  the  market.  Set  up  the  associated  game  and  find  all  its  Nash  equilibria. 

6.9.  Variations  on  Stackelberg 

(a)  Suppose,  in  the  model  in  Sect.  6.4,  that  the  firms  have  different  marginal  costs 
ci  and  C2  (cf.  Problem  6.1).  Compute  the  Stackelberg  equilibrium  and  outcome 
with  firm  1  as  a  leader  and  with  firm  2  as  a  leader. 

(b)  Give  a  logical  argument  why  the  payoff  of  the  leader  in  a  Stackelberg  equilib¬ 
rium  is  always  at  least  as  high  as  his  payoff  in  the  Cournot  equilibrium.  Can  you 
generalize  this  to  arbitrary  games? 

(c)  Consider  the  situation  in  Sect.  6.4,  but  now  assume  that  there  are  n  firms,  firm  1 
moves  first,  firm  2  second,  etc.  Assume  again  perfect  information,  and  compute 
the  subgame  perfect  equilibrium. 

6.10.  First-Price  Sealed-Bid  Auction 

Consider  the  game  associated  with  the  first-price  sealed-bid  auction  in  Sect.  6.5.1, 
with  v\  >  . . .  >  vn  >  0  as  there. 

(a)  Show  that  (b\,  Z?2,  £3, . . . ,  bn )  =  (V2,  V2,  V3, . . . ,  vn)  is  a  Nash  equilibrium  in 
this  game. 

(b)  Show  that,  in  any  Nash  equilibrium  of  the  game,  a  player  with  the  highest 
valuation  obtains  the  object.  Exhibit  at  least  two  other  Nash  equilibria  in  this 
game,  apart  from  the  equilibrium  in  (a). 

(c)  Show  that  bidding  one’s  true  valuation  as  well  as  bidding  higher  than  one’s 
true  valuation  are  weakly  dominated  strategies.  Also  show  that  any  positive 
bid  lower  than  one’s  true  valuation  is  not  weakly  dominated.  (Note:  to  show 
that  a  strategy  is  weakly  dominated  one  needs  to  exhibit  some  other  strategy 
that  is  always — that  is,  whatever  the  other  players  do — at  least  as  good  as  the 
strategy  under  consideration  and  at  least  once — that  is,  for  at  least  one  strategy 
combination  of  the  other  players — strictly  better.) 

(d)  Show  that,  in  any  Nash  equilibrium  of  this  game,  at  least  one  player  plays  a 
weakly  dominated  strategy. 

6.11.  Second-Price  Sealed-Bid  Auction 

Consider  the  game  associated  with  the  second  price  sealed  bid  auction  in  Sect.  6.5.1. 

(a)  Formulate  the  payoff  functions  in  this  game. 

(b)  Show  that  (b\ , . . . ,  bn)  =  (rq , . . . ,  vn)  is  a  Nash  equilibrium  in  this  game. 

(c)  Show,  for  each  player,  that  bidding  one’s  true  valuation  weakly  dominates  any 
other  action.  (Show  that  this  holds  even  if  each  player  only  knows  his  own 
valuation.) 
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(d)  Show  that  (b\, ...  ,bn)  =  (r>2,  v\ ,  0, . . . ,  0)  is  a  Nash  equilibrium  in  this  game. 
What  about  (b\ , . . . ,  bn )  =  (ui  ,0,0,...,  0)? 

(e)  Determine  all  Nash  equilibria  in  the  game  with  two  players  ( n  —  2).  (Hint: 
compute  the  best  reply  functions  and  make  a  diagram.) 

6.12.  Third-Price  Sealed-Bid  Auction 

In  the  auction  of  Sect.  6.5.1,  assume  that  there  are  at  least  three  bidders  and  that  the 
highest  bidder  wins  and  pays  the  third  highest  bid. 

(a)  Show  that  for  any  player  i  bidding  Vi  weakly  dominates  any  lower  bid  but  does 
not  weakly  dominate  any  higher  bid. 

(b)  Show  that  the  strategy  combination  in  which  each  player  i  bids  his  true  valuation 
Vi  is  in  general  not  a  Nash  equilibrium. 

(c)  Find  some  Nash  equilibria  of  this  game. 

6.13.  n-Player  First-Price  Sealed-Bid  Auction  with  Incomplete  Information 
Consider  the  setting  of  Sect.  6.5.2  but  now  assume  that  the  number  of  bid- 
ders/players  is  n  >  2.  Show  that  (s*, . . . ,  s*)  with  s* (i?;)  =  (1  —  1  /n)vt  for  every 
player  i  is  a  Nash  equilibrium  of  this  game.  (Hence,  for  large  n ,  each  bidder  almost 
bids  his  true  valuation.) 


6.14.  Double  Auction 

This  problem  is  about  the  auction  in  Sect.  6.5.3. 

(a)  Fix  a  numbers  e  [0, 1]  and  consider  the  following  strategies  pb{ •)  and  ps( •): 


Pb(Vb )  = 


x  if  Vb  >  v 
0  if  vt,  <  x 


and  Ps(Vs) 


x  if  vs  <x 
1  if  vs  >  x 


Show  that  these  strategies  constitute  a  Nash  equilibrium. 

(b)  For  the  equilibrium  in  (a),  compute  the  probability  that  trade  takes  place 
conditional  on  v b  >  vs.  For  which  value  of  v  is  this  probability  maximal? 

(c)  Consider  linear  strategies  of  the  form  Pb(vb )  =  ®b  +  cbvb  and  ps(vs )  =  as  + 
csvs,  where  ab,Cb,as,  cs  are  positive  constants.  Determine  the  values  of  the  four 
constants  so  that  these  strategies  constitute  a  Nash  equilibrium. 

(d)  Answer  the  same  question  as  in  (b)  for  the  equilibrium  in  (c).  Which  of  the 
equilibria  in  (b)  and  (c)  is  the  most  efficient,  i.e.,  has  the  highest  probability  of 
resulting  in  trade  conditional  on  Vb  >  vsl 


6.15.  Mixed  Strategies  and  Objective  Uncertainty 
Consider  the  bimatrix  game 


T 

B 


L  R 

4.1  1,3 

1.2  3,0 
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(a)  Compute  the  Nash  equilibrium  of  this  game. 

(b)  Add  some  uncertainty  to  the  payoffs  of  this  game  and  find  a  pure  (Bayesian) 
Nash  equilibrium  of  the  resulting  game  of  incomplete  information,  such  that 
the  induced  a  priori  mixed  strategies  converge  to  the  Nash  equilibrium  of  the 
original  game  as  the  amount  of  uncertainty  shrinks  to  0. 

6.16.  Variations  on  Finite  Horizon  Bargaining 

(a)  Adapt  the  arguments  and  the  results  of  Sect.  6.7.1  for  the  case  where  T  is  even 
and  the  case  where  player  2  proposes  at  even  rounds. 

(b)  Let  T  —  3  in  Sect.  6.7.1  and  suppose  that  the  players  have  different  discount 
factors  <5i  and  82.  Compute  the  subgame  perfect  equilibrium  and  the  subgame 
perfect  equilibrium  outcome. 

(c)  Consider  again  the  model  of  Sect.  6.7.1,  let  T  —  3,  but  now  assume  that  the 

utility  function  of  player  2  is  112(0)  —  +Jct  for  all  o  e  [0,1].  Hence,  the  utility 
of  receiving  a  at  time  t  for  player  2  is  equal  to  Compute  the  subgame 

perfect  equilibrium  and  the  subgame  perfect  equilibrium  outcome. 

(d)  Suppose,  in  the  model  of  Sect.  6.7.1,  that  at  time  T  the  ‘disagreement’  distribu¬ 
tion  is  s  =  (s\ ,  S2)  with  S\,S2  >  0  and  s\  +  S2  <  1,  rather  than  (0,  0).  Compute 
the  subgame  perfect  equilibrium  and  the  subgame  perfect  equilibrium  outcome. 

(e)  In  (d),  compute  the  limits  of  the  equilibrium  shares  for  T  going  to  infinity.  Do 
these  limits  depend  on  s? 

(f)  Show,  in  the  game  in  Sect.  6.7.1,  that  subgame  perfection  really  has  a  bite. 
Specifically,  for  every  s  =  (s\,S2)  with  S\,S2  >  0  and  s\  +  S2  =  1,  exhibit 
a  Nash  equilibrium  of  the  game  in  Fig.  6.5  resulting  in  the  distribution  s. 

6.17.  Variations  on  Infinite  Horizon  Bargaining 

(a)  Determine  the  subgame  perfect  equilibrium  outcome  and  subgame  perfect 
equilibrium  strategies  in  the  game  in  Sect.  6.7.2  when  the  players  have  different 
discount  factors  8  \  and  82. 

(b)  Determine  the  subgame  perfect  equilibrium  outcome  and  subgame  perfect 
equilibrium  strategies  in  the  game  in  Sect.  6.7.2  when  player  2  proposes  at  even 
rounds  and  player  1  at  odd  rounds. 

(c)  Determine  the  subgame  perfect  equilibrium  outcome  and  subgame  perfect 
equilibrium  strategies  in  the  game  in  Sect.  6.7.2  when  the  ‘disagreement’ 
distribution  is  s  =  (s\,S2)  with  s\,  S2  >0  and  s\  +  S2  <  1,  rather  than  (0, 0),  in 
case  the  game  never  stops. 

(d)  Consider  the  game  in  Sect.  6.7.2,  but  now  assume  that  the  utility  function  of 
player  2  is  112(0)  —  ^fot  for  all  o'  e  [0,1].  Hence,  the  utility  of  receiving  ot  at 
time  t  for  player  2  is  equal  to  8T *Jol.  Compute  the  subgame  perfect  equilibrium 
and  the  subgame  perfect  equilibrium  outcome.  [Hint:  first  determine  for  this 
situation  the  values  for  x*  and  y*  analogous  to  (6.10).] 
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(e)  Interpret,  as  at  the  end  of  Sect.  6.7.2,  the  discount  factor  as  the  probability  that 
the  game  continues  to  the  next  round.  Show  that  the  game  ends  with  probability 
equal  to  1 . 

6.18.  A  Principal-Agent  Game 

There  are  two  players:  a  worker  (the  agent)  and  an  employer  (the  principal).  The 
worker  has  three  choices:  either  reject  the  contract  offered  to  him  by  the  employer, 
or  accept  this  contract  and  exert  high  effort,  or  accept  the  contract  and  exert  low 
effort.  If  the  worker  rejects  the  contract  then  the  game  ends  with  a  payoff  of  zero 
to  the  employer  and  a  payoff  of  2  to  the  worker  (his  reservation  payoff).  If  the 
worker  accepts  the  contract  he  works  for  the  employer:  if  he  exerts  high  effort  the 
revenues  for  the  employer  will  be  12  with  probability  0.8  and  6  with  probability 
0.2;  if  he  exerts  low  effort  then  these  revenues  will  be  12  with  probability  0.2  and 
6  with  probability  0.8.  The  employer  can  only  observe  the  revenues  but  not  the 
effort  exerted  by  the  worker:  in  the  contract  he  specifies  a  high  wage  w#  in  case  the 
revenues  equal  12  and  a  low  wage  wl  in  case  the  revenues  are  equal  to  6.  These 
wages  are  the  respective  choices  of  the  employer.  The  final  payoff  to  the  employer 
if  the  worker  accepts  the  contract  will  be  equal  to  revenues  minus  wage.  The  worker 
will  receive  his  wage;  his  payoff  equals  this  wage  minus  3  if  he  exerts  high  effort 
and  this  wage  minus  0  if  he  exerts  low  effort. 

(a)  Set  up  the  extensive  form  of  this  game.  Does  this  game  have  incomplete  or 
imperfect  information?  What  is  the  associated  strategic  form? 

(b)  Determine  the  subgame  perfect  equilibrium  or  equilibria  of  the  game. 

6.19.  The  Market  for  Lemons 

A  buyer  wants  to  buy  a  car  but  does  not  know  whether  the  particular  car  he  is 
interested  in  has  good  or  bad  quality  (a  lemon  is  a  car  of  bad  quality).  About  half 
of  the  market  consists  of  good  quality  cars.  The  buyer  offers  a  price  p  to  the  seller, 
who  is  informed  about  the  quality  of  the  car;  the  seller  may  then  either  accept  of 
reject  this  price.  If  he  rejects,  there  is  no  sale  and  the  payoff  will  be  0  to  both.  If  he 
accepts,  the  payoff  to  the  seller  will  be  the  price  minus  the  value  of  the  car,  and  to 
the  buyer  it  will  be  the  value  of  the  car  minus  the  price.  A  good  quality  car  has  a 
value  of  15,000,  a  lemon  has  a  value  of  5,000. 

(a)  Set  up  the  extensive  as  well  as  strategic  form  of  this  game. 

(b)  Compute  the  subgame  perfect  equilibrium  or  equilibria  of  this  game. 

6.20.  Corporate  Investment  and  Capital  Structure 

Consider  an  entrepreneur  who  has  started  a  company  but  needs  outside  financing  to 
undertake  an  attractive  new  project.  The  entrepreneur  has  private  information  about 
the  profitability  of  the  existing  company,  but  the  payoff  of  the  new  project  cannot  be 
disentangled  from  the  payoff  of  the  existing  company — all  that  can  be  observed  is 
the  aggregate  profit  of  the  firm.  Suppose  the  entrepreneur  offers  a  potential  investor 
an  equity  stake  in  the  firm  in  exchange  for  the  necessary  financing.  Under  what 
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circumstances  will  the  new  project  be  undertaken,  and  what  will  the  equity  stake 
be?  In  order  to  model  this  as  a  game,  assume  that  the  profit  of  the  existing  company 
can  be  either  high  or  low:  n  —  L  or  n  =  77,  where  H  >  L  >  0.  Suppose  that 
the  required  investment  for  the  new  project  is  7,  the  payoff  will  be  R ,  the  potential 
investor’s  alternative  rate  of  return  is  r,  with  R  >  7(1  +  r).  The  game  is  played  as 
follows. 


1 .  Nature  determines  the  profit  of  the  existing  company.  The  probability  that  n  —  L 
is  p. 

2.  The  entrepreneur  learns  tt  and  then  offers  the  potential  investor  an  equity  stake 
s,  where  0  <  s  <  1 . 

3.  The  investor  observes  s  (but  not  tt)  and  then  decides  either  to  accept  or  to  reject 
the  offer. 

4.  If  the  investor  rejects  then  the  investor’s  payoff  is  7(1  +  r)  —  7  and  the 
entrepreneur’s  payoff  is  tt.  If  he  accepts  his  payoff  is  s(jt  +  R)  —  I  and  the 
entrepreneur’s  is  (1  —  s)(tz  +7?). 

(a)  Set  up  the  extensive  form  and  the  strategic  form  of  this  signaling  game. 

(b)  Compute  the  perfect  Bayesian  Nash  equilibrium  or  equilibria,  if  any. 


6.21.  A  Poker  Game 

Consider  the  following  game.  There  are  two  players,  I  and  II.  Player  I  deals  II  one  of 
three  cards — Ace,  King,  or  Queen — at  random  and  face  down.  II  looks  at  the  card. 
If  it  is  an  Ace,  II  must  say  ‘Ace”,  if  a  King  he  can  say  “King”  or  “Ace”,  and  if  a 
Queen  he  can  say  “Queen”  or  “Ace”.  If  II  says  “Ace”  player  I  can  either  believe  him 
and  give  him  $1  or  ask  him  to  show  his  card.  If  it  is  an  Ace,  I  must  pay  II  $2,  but  if 
it  is  not,  II  pays  I  $2.  If  II  says  “King”  neither  side  looses  anything,  but  if  he  says 
“Queen”  II  must  pay  player  I  $1. 


(a)  Set  up  the  extensive  form  and  the  strategic  form  of  this  zerosum  game. 

(b)  Determine  its  value  and  optimal  strategies  (cf.  Chap.  2). 

6.22.  A  Hotelling  Location  Problem 

Consider  n  players  each  choosing  a  location  in  the  interval  [0, 1].  One  may  think 
of  n  shops  choosing  locations  in  a  street,  n  firms  choosing  product  characteristics 
on  a  continuous  scale  from  0  to  1 ,  or  n  political  parties  choosing  positions  on  the 
ideological  scale.  We  assume  that  customers  or  voters  are  uniformly  distributed  over 
the  interval,  with  a  total  of  1 .  The  customers  go  to  (voters  vote  for)  the  nearest  shop 
(candidate).  For  example,  if  n  =  2  and  the  chosen  positions  are  x\  —  0.2  and 
X2  —  0.6,  then  1  obtains  0.4  and  2  obtains  0.6  customers  (votes).  In  case  two  or 
more  players  occupy  the  same  position  they  share  the  customers  or  voters  for  that 
position  equally. 

In  the  first  scenario,  the  players  care  only  about  winning  or  loosing  in  terms  of 
the  number  of  customers  or  votes.  This  scenario  may  be  prominent  for  presidential 
elections,  as  an  example.  For  each  player  the  best  alternative  is  to  be  the  unique 
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winner,  the  second  best  alternative  is  to  be  one  of  the  winners,  and  the  worst 
alternative  is  not  to  win.  For  this  scenario,  answer  questions  (a)  and  (b). 

(a)  Show  that  there  is  a  unique  Nash  equilibrium  for  ^  =  2. 

(b)  Exhibit  a  Nash  equilibrium  for  n  —  3. 

In  the  second  scenario,  the  payoffs  of  the  players  are  given  by  the  total  numbers  of 
customers  (or  voters)  they  acquire.  For  this  scenario,  answer  questions  (c)  and  (d). 

(c)  Show  that  there  is  a  unique  Nash  equilibrium  for  ^  =  2. 

(d)  Is  there  a  Nash  equilibrium  for  ^  =  3?  How  about  n  —  4? 

6.23.  Median  Voting 

Of  the  n  persons  in  a  room,  each  person  i  has  a  most  favorite  room  temperature  tt, 
and  the  further  away  (lower  or  higher)  the  room  temperature  is  from  the  worse  it 
is  for  this  person.  Specifically,  if  the  room  temperature  is  x,  then  person  V s  utility 
is  equal  to  —  \x  —  h\.  In  order  to  find  a  compromise,  the  janitor  asks  each  person  to 
propose  a  room  temperature,  and  based  on  the  proposed  temperatures  a  compromise 
is  determined.  The  proposed  temperatures  are  not  necessarily  equal  to  the  favorite 
temperatures.  Only  temperatures  (proposed  and  favorite)  in  the  interval  0-30  °C  are 
possible. 

(a)  Suppose  the  janitor  announces  that  he  will  take  the  average  of  the  proposed 
temperatures  as  the  compromise  temperature.  Formulate  this  situation  as  an 
^-person  game,  that  is,  give  the  strategy  sets  of  the  players  and  the  payoff 
functions.  Does  this  game  have  a  Nash  equilibrium? 

(b)  Suppose  n  is  odd,  and  suppose  the  janitor  announces  that  he  will  take  the  median 
of  the  proposed  temperatures  as  the  compromise  temperature.  Formulate  this 
situation  as  an  n-person  game,  that  is,  give  the  strategy  sets  of  the  players  and 
the  payoff  functions.  Show  that,  for  each  player,  proposing  his  ideal  temperature 
weakly  dominates  any  other  strategy:  thus,  in  particular,  (t\, . . . ,  tn)  is  a  Nash 
equilibrium  of  this  game.  Does  the  game  have  any  other  Nash  equilibria? 

6.24.  The  Uniform  Rule 

An  amount  M  >  0  of  a  good  (labor,  green  pea  soup, . . .)  is  to  be  distributed 
completely  among  n  persons.  Each  person  i  considers  an  amount  U  >  0  as  the  ideal 
amount,  and  the  further  away  the  allocated  amount  is  from  this  ideal,  the  worse  it 
is.  Specifically,  if  the  amount  allocated  to  person  i  is  jc/,  then  person  V s  utility  is 
equal  to  —  \x  —  U\.  In  order  to  find  a  compromise,  each  person  is  asked  to  report 
an  amount,  and  based  on  the  reported  amounts  a  compromise  is  determined.  Let 
the  ideal  amounts  be  given  by  h  <  t2  <  . . .  <  tn.  The  reported  amounts  are  not 
necessarily  equal  to  the  ideal  amounts. 

(a)  Suppose  M  is  distributed  proportionally  to  the  reported  amounts,  that  is,  if  the 
reported  amounts  are  (n , . . . ,  rn),  then  person  i  receives  Xi  —  (rtf  Ylj=  i  rj) 
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(If  all  rj  are  zero  then  take  Xi  —  M/n.)  Formulate  this  situation  as  a  game.  Does 
this  game  have  a  Nash  equilibrium? 

Consider  the  following  division  rule,  called  the  uniform  rule.  Let  (r\, ...  ,rn)  denote 
the  reported  amounts.  If  M  <  Y^!j=  1  rF  ^en  ea°h  Person  *  receives 

Xi  =  min {ri9  A}  , 

where  A  is  such  that  YTj=\  xi  =  Af.  If  M  >  J2j=  1  rh  then  ea°h  Person  i  receives 

Xi  =  max{rf,  A}  , 

where,  again,  A  is  such  that  YTj=\  xj  —  M. 

(b)  Suppose  that  n  —  3  and  r\  —  1,  r^  —  2,  and  —  3.  Apply  the  uniform  rule  for 
M  =  4,  Af  =  5,  Af  =  5.5,  M  =  6,  Af  =  6.5,  Af  =  7,  Af  =  8,  Af  =  9. 

(c)  Suppose,  for  the  general  case,  that  the  uniform  rule  is  used  to  distribute  the 
amount  M.  Formulate  this  situation  as  a  game.  Show  that  reporting  one’s  ideal 
amount  weakly  dominates  any  other  strategy:  thus,  in  particular,  (t\ , . . . ,  tn)  is  a 
Nash  equilibrium  of  this  game.  Does  the  game  have  any  other  Nash  equilibria? 

6.25.  Reporting  a  Crime 

There  are  n  individuals  who  witness  a  crime.  Everybody  would  like  the  police  to  be 
called.  If  this  happens,  each  individual  derives  satisfaction  v  >  0  from  it.  Calling 
the  police  has  a  cost  of  c,  where  0  <  c  <  v.  The  police  will  come  if  at  least  one 
person  calls.  Hence,  this  is  an  n-person  game  in  which  each  player  chooses  from 
{C,N}:  C  means  ‘call  the  police’  and  N  means  ‘do  not  call  the  police’.  The  payoff 
to  person  i  is  0  if  nobody  calls  the  police,  v  —  c  if  i  (and  perhaps  others)  call  the 
police,  and  v  if  the  police  is  called  but  not  by  person  i. 

(a)  What  are  the  Nash  equilibria  of  this  game  in  pure  strategies?  In  particular,  show 
that  the  game  does  not  have  a  symmetric  Nash  equilibrium  in  pure  strategies  (a 
Nash  equilibrium  is  symmetric  if  every  player  plays  the  same  strategy). 

(b)  Compute  the  symmetric  Nash  equilibrium  or  equilibria  in  mixed  strategies. 
(Hint:  suppose,  in  such  an  equilibrium,  every  person  plays  C  with  probability 
0  <  p  <  1 .  Use  the  fact  that  each  player  must  be  indifferent  between  C  and  N.) 

(c)  For  the  Nash  equilibrium/equilibria  in  (b),  compute  the  probability  of  the  crime 
being  reported.  What  happens  to  this  probability  if  n  becomes  large? 

6.26.  Firm  Concentration 

Consider  a  market  with  ten  firms.  Simultaneously  and  independently,  the  firms 
choose  between  locating  downtown  and  locating  in  the  suburbs.  The  profit  of 
each  firm  is  influenced  by  the  number  of  other  firms  that  locate  in  the  same  area. 
Specifically,  the  profit  of  a  firm  that  locates  downtown  is  given  by  5n  —  n2  +  50, 
where  n  denotes  the  number  of  firms  that  locate  downtown.  Similarly,  the  profit  of 
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a  firm  that  locates  in  the  suburbs  is  given  by  48  —  m,  where  m  denotes  the  number 
of  firms  that  locate  in  the  suburbs.  In  equilibrium  how  many  firms  locate  in  each 
region  and  what  is  the  profit  of  each? 

6.27.  Tragedy  of  the  Commons 

There  are  n  farmers,  who  use  a  common  piece  of  land  to  graze  their  goats.  Each 
farmer  i  chooses  a  number  of  goats  g; — for  simplicity  we  assume  that  goats  are 
perfectly  divisible.  The  value  to  a  farmer  of  grazing  a  goat  when  the  total  number 
of  goats  is  G,  is  equal  to  v  (G)  per  goat.  We  assume  that  there  is  a  number  G  such 
that  v(G)  >  0  for  G  <  G  and  v(G )  =  0  for  G  >  G.  Moreover,  v  is  continuous,  and 
twice  differentiable  at  all  G  ^  G,  with  v'(G)  <  0  and  v"(G)  <  0  for  G  <  G.  The 
payoff  to  farmer  i  if  each  farmer  j  chooses  gj,  is  equal  to 

giv(g  1  +  •  •  •  +  gi- 1  +  gi  +  gi+ 1  +  •  •  •  +  gn)  ~  Cgi  , 

where  c  >  0  is  the  cost  per  goat. 

(a)  Interpret  the  conditions  on  the  function  v. 

(b)  Show  that  the  total  number  of  goats  in  a  Nash  equilibrium  (g*, . . . ,  g*)  of  this 
game,  G*  =  g*  +  . . .  +  g*,  satisfies 

u(G*)  +  (1  //i)GV(G*)  -  c  =  0  . 

(c)  The  socially  optimal  number  of  goats  G**  is  obtained  by  maximizing  Gv(G )  — 
cG  over  G  >  0.  Show  that  G**  satisfies 

u(G**)  +  G*V(G**)-c  =  0. 

(d)  Show  that  G*  >  G** .  (Hence,  in  a  Nash  equilibrium  too  many  goats  are  grazed.) 

6.9  Notes 

The  Cournot  model  in  Sect.  6.2  dates  back  from  Cournot  (1838),  and  the  Bertrand 
model  in  Sect.  6.3  from  Bertrand  (1883).  The  occurrence  of  the  Bertrand  price 
equilibrium  is  often  referred  to  as  the  Bertrand  paradox.  On  Cournot  versus 
Bertrand,  see  Magnan  de  Bornier  (1992).  The  Stackelberg  equilibrium  (Sect.  6.4) 
is  named  after  von  Stackelberg  (1934). 

Our  coverage  of  auction  theory  is  limited,  and  based  on  Osborne  (2004)  and 
Gibbons  (1992).  For  more  extensive  overviews  and  treatments  see  Milgrom  (2004) 
and  Krishna  (2002).  The  second  price  auction  is  also  called  Vickrey  auction 
(Vickrey,  1961).  The  condition  that  bidders  bid  their  true  valuation  is  an  example 
of  the  incentive  compatibility  requirement. 

Section  6.6  is  based  on  Harsanyi  (1973),  see  also  Gibbons  (1992). 
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Axiomatic  bargaining  theory  was  initiated  by  Nash  (1950).  The  noncooperative, 
strategic  approach  in  Sect.  6.7  is  based  on  Rubinstein  (1982).  See  also  Nash  (1953) 
for  a  noncooperative  approach  to  the  Nash  bargaining  solution. 

The  goods  in  the  model  of  Problem  6.3  are  strategic  substitutes.  In  duopoly 
models  such  as  this  the  distinction  between  strategic  substitutes  and  strategic  com¬ 
plements  is  important  for  the  differences  between  quantity  and  price  competition. 
See,  e.g.,  Tirole  (1988). 

Problem  6.19  (market  for  lemons),  exhibiting  the  adverse  selection  problem,  is 
based  on  Akerlof  (1970).  Problem  6.20  is  taken  from  Gibbons  (1992).  Problem  6.21 
(poker  game)  is  taken  from  Thomas  (1986).  For  an  axiomatization  of  the  median 
voting  method  in  Problem  6.23  see  Moulin  (1980),  and  for  an  axiomatization  of  the 
uniform  rule  in  Problem  6.24  see  Sprumont  (1991). 

Problem  6.26  is  taken  from  Watson  (2002).  For  the  tragedy  of  the  commons 
situation  in  Problem  6.27  see  Hardin  (1968)  and  Gibbons  (1992). 
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Repeated  Games 


In  the  famous  prisoners’  dilemma  game  the  bad  (Pareto  inferior)  outcome,  resulting 
from  each  player  playing  his  dominant  action,  cannot  be  avoided  in  a  Nash 
equilibrium  or  subgame  perfect  Nash  equilibrium  even  if  the  game  is  repeated  a 
finite  number  of  times,  cf.  Problem  4.10.  As  we  will  see  in  this  chapter,  this  bad 
outcome  can  be  avoided  if  the  game  is  repeated  an  infinite  number  of  times.  This, 
however,  is  coming  at  a  price,  namely  the  existence  of  a  multitude  of  outcomes 
attainable  in  equilibrium.  Such  an  embarrassment  of  riches  is  expressed  by  a  so- 
called  folk  theorem. 

As  was  illustrated  in  Problem  4.11,  also  finite  repetitions  of  a  game  may 
sometimes  lead  to  outcomes  that  are  better  than  (repeated)  Nash  equilibria  of  the 
original  game. 

In  this  chapter  we  consider  two-person  infinitely  repeated  games  and  formulate 
folk  theorems  both  for  subgame  perfect  and  for  Nash  equilibrium.  The  approach 
is  somewhat  informal,  and  mainly  based  on  examples.  In  Sect.  7.1  we  consider 
subgame  perfect  equilibrium  and  in  Sect.  7.2  we  consider  Nash  equilibrium. 


7.1  Subgame  Perfect  Equilibrium 

7.1 .1  The  Prisoners'  Dilemma 

Consider  the  prisoners’  dilemma  game  (in  the  form  of  the  ‘marketing  game’  of 
Problem  3.1(d)) 


C  D 

Cf  50,50  30,  60  \ 
D  y  60,  30  40, 40  J 
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In  G  each  player  has  a  strictly  dominated  action,  namely  C,  and  ( D ,  D )  is  the  unique 
Nash  equilibrium  of  the  game,  also  if  mixed  strategies  are  allowed. 

We  assume  now  that  G  is  played  infinitely  many  times,  at  times  t  =  0, 1, 2, . . ., 
and  that  after  each  play  of  G  the  players  learn  what  has  been  played,  i.e.,  they  learn 
which  element  of  the  set  {(C,  C),  (C,  D),  ( D ,  C),  ( D ,  D)}  has  occurred.  For  instance, 
in  the  marketing  game,  one  can  think  of  the  game  being  played  once  per  period — 
a  week,  month — each  player  observing  in  each  period  whether  his  opponent  has 
advertised  or  not.  Note  that  a  player  does  not  learn  the  exact,  possibly  mixed,  action 
of  his  opponent,  but  only  its  realization.  These  realizations  induce  an  infinite  stream 
of  associated  payoffs,  and  we  assume  that  there  is  a  common  discount  factor  0  < 
8  <  1  such  that  the  final  payoff  to  each  player  is  the  8 -discounted  value  of  the 
infinite  stream.  That  is,  player  i  {i  —  1 , 2)  obtains 

oo 

(payoff  from  play  of  the  stage  game  G  at  time  t)  •  8T  . 
t= o 


Here,  the  expression  stage  game  is  used  for  the  one-shot  game  G,  in  order  to 
distinguish  the  one-shot  game  from  the  repeated  game. 

As  always,  a  strategy  of  a  player  is  a  complete  plan  to  play  the  game.  This  means 
that,  at  each  moment  t ,  this  plan  should  prescribe  an  action  of  a  player — a  mixed 
or  pure  strategy  in  the  stage  game  G — for  each  possible  history  of  the  game  up  to 
time  t ,  that  is,  an  action  for  each  sequence  of  length  t  (namely,  at  0, . . . ,  t  —  1)  of 
elements  from  the  set  {(C,  C),  (C,  D ),  ( D ,  C),  ( D ,  D)}.  Clearly,  such  a  strategy  can 
be  quite  complicated  and  the  number  of  possible  strategies  is  enormous.  We  will  be 
able,  however,  to  restrict  attention  to  relatively  simple  strategies. 

The  infinite  extensive  form  game  just  defined  is  denoted  by  G°°(S).  A  natural 
solution  concept  for  this  game  is  subgame  perfect  (Nash)  equilibrium.  Each 
subgame  in  G°°{8)  is,  basically,  equal  to  the  game  G°°(8)  itself:  the  difference 
between  two  subgames  is  the  difference  between  the  two  histories  leading  to  those 
subgames.  For  instance,  at  t  =  6,  there  are  4  6  possible  histories  of  play  and  therefore 
there  are  46  different  subgames;  each  of  these  subgames,  however,  looks  exactly  like 
G°°(S). 

We  will  now  exhibit  a  few  subgame  perfect  equilibria  of  G°°(8).  First  consider 
the  simple  strategy 

D°° :  play  D  at  each  moment  t  —  0,1,2,...,  independent  of  the  history  of  the 

game,  i.e.,  independent  of  what  was  played  before  t. 

Then  D°°  is  a  well-defined  strategy.  If  both  players  play  D°°  then  the  resulting 
payoff  is 


oo 

40  S'  =  40/(1  -5) 

t= 0 
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for  each  player.  We  claim  that  ( D 00,D00)  is  a  subgame  perfect  equilibrium  in 
G°°(S).  Consider  any  t  —  0,1,...  and  any  subgame  starting  at  time  t.  Then 
(D°°,  D°°)  induces  a  Nash  equilibrium  in  this  subgame:  given  that  player  2  always 
plays  D ,  player  1  cannot  do  better  than  always  play  D  as  well,  and  vice  versa.  Hence, 
(D°°  ,D°°)  is  a  subgame  perfect  equilibrium.  In  this  subgame  perfect  equilibrium, 
the  players  just  play  the  Nash  equilibrium  of  the  stage  game  at  every  time  t. 

We  next  exhibit  another  subgame  perfect  equilibrium.  Consider  the  following 
strategy: 

7r(C) :  at  t  —  0  and  at  every  time  t  such  that  in  the  past  only  (C,  C)  has 

occurred  in  the  stage  game:  play  C.  Otherwise,  play  D. 

Strategy  Tr(C)  is  an  example  of  a  so-called  trigger  strategy .  In  general,  if  the  players 
play  trigger  strategies,  they  follow  some  fixed  pattern  of  play  until  a  deviation 
occurs:  then  a  Nash  equilibrium  action  of  the  stage  game  is  played  forever.  In  the 
present  example,  7r(C),  a  player  starts  by  playing  C  and  keeps  on  playing  C  as  long 
as  both  players  have  only  played  C  in  the  past,  i.e.,  as  long  as  the  history  of  play 
is  (C,  C), . . . ,  (C,  C);  after  any  deviation  from  this,  i.e.,  if  the  history  of  play  is  not 
(C,  C), . . . ,  (C,  C),  the  player  plays  D  and  keeps  on  playing  D  forever.  Again,  7r(C) 
is  a  well-defined  strategy,  and  if  both  players  play  7r(C),  then  each  player  obtains 
the  payoff 


oo 

£>«'  =  50/(1  -8)  . 

t= 0 


Is  (7r(C),  7r(C))  also  a  subgame  perfect  equilibrium?  The  answer  is  a  qualified  yes: 
if  5  is  large  enough,  then  it  is.  The  crux  of  the  argument  is  as  follows.  At  each  stage 
of  the  game,  a  player  has  an  incentive  to  deviate  from  C  and  play  his  dominant 
action  D ,  thereby  obtaining  a  momentary  gain  of  10.  Deviating,  however,  triggers 
eternal  ‘punishment’  by  the  other  player,  who  is  going  to  play  D  forever.  The  best 
reply  to  this  punishment  is  to  play  D  as  well,  entailing  a  loss  of  10  at  each  moment 
from  the  next  moment  on.  The  discounted  value,  at  the  moment  of  deviation,  of  this 
loss  is  equal  to  105/(1  —  5),  and  to  keep  a  player  from  deviating  this  loss  should  be 
at  least  as  large  as  the  momentary  gain  of  10.  This  is  the  case  if  and  only  if  5  >  1/2. 

More  formally,  we  can  distinguish  two  kinds  of  subgames  that  are  relevant  for 
the  strategy  combination  (7r(C),  7r(C)).  One  kind  are  those  subgames  where  not 
always  (C,  C)  has  been  played  in  the  past.  In  such  a  subgame,  7r(C)  tells  a  player 
to  play  D  forever,  and  therefore  the  best  reply  of  the  other  player  is  to  do  so  as  well, 
which  means  indeed  to  play  according  to  7r(C).  Thus,  in  this  kind  of  subgame, 
(7r(C),  7r(C))  is  a  Nash  equilibrium. 


124 


7  Repeated  Games 


In  the  other  kind  of  subgame,  no  deviation  has  occurred  so  far:  in  the  past  always 
(C,  C)  has  been  played.  Consider  this  subgame  at  some  time  T  and  suppose  that 
player  2  plays  7r(C).  If  player  1  plays  7r(C)  as  well,  his  payoff  is  equal  to 


T- 1  oo 


y]  so  s'  +  y]  so  s' . 

t= 0  t=T 


If,  instead,  he  deviates  at  time  T  to  77,  he  obtains  maximally 


T- 1 


oo 


y]  50  8‘  +  60  ST  +  J2  40  8‘  ■ 

t= o  r=r+i 


The  first  term  in  this  expression  is  the  discounted  payoff  from  (C,  C)  at  t  = 
0 , ,T  —  1.  The  second  term  is  the  discounted  payoff  from  ( D ,  C)  at  time  72  The 
third  term  is  the  discounted  payoff  from  ( D ,  77)  from  time  t  =  T  +  1  on.  Note  that 
from  t  =  T  +  1  on  player  2  plays  77,  according  to  his  strategy  7r(C),  and  the  best 
that  the  (deviating)  player  1  can  do  is  to  play  77  as  well. 

Hence,  to  avoid  deviation  (and  make  7r(C)  player  l’s  best  reply  in  the  subgame) 
we  need  that  the  first  payoff  is  at  least  as  high  as  the  second  one,  resulting  in  the 
inequality 


50  8T/(  1  -  S)  >  60  ST  +  40  ST+1/(  1  -  S) 

or,  equivalently,  S  >  1/2 — as  found  before.  The  arguments  for  the  roles  of 
the  players  reversed  are  exactly  equal.  We  conclude  that  for  every  S  >  1/2, 
(7r(C),  Tr(C))  is  a  subgame  perfect  equilibrium  of  the  game  G°°(S).  The  existence 
of  this  equilibrium  is  a  major  reason  to  study  infinitely  repeated  games.  In  popular 
terms,  it  shows  that  cooperation  is  sustainable  if  deviations  can  be  credibly  punished, 
which  is  the  case  if  the  future  is  sufficiently  important  (i.e.,  S  large  enough). 

To  exhibit  yet  another  subgame  perfect  equilibrium,  different  from  (D00,^00) 
and  (7>(C),  7r(C)),  consider  the  following  strategies  for  players  1  and  2,  respec¬ 
tively. 

7>i :  As  long  as  the  sequence  (C,  77),  (D,  C),  (C,  77),  (77,  C),  (C,  77),  (D,  C), 

. . .  has  occurred  in  the  past  from  time  0  on,  play  C  at  t  e  {0,  2,  4,  6, . . play 
D  at  t  G  {1, 3,  5,  7, . . .}.  Otherwise,  play  D. 

Tr2:  As  long  as  the  sequence  (C,  77),  (77,  C),  (C,  77),  (77,  C),  (C,  77),  (77,  C), 

. . .  has  occurred  in  the  past  from  time  0  on,  play  D  at  t  e  {0,  2,  4,  6, . . .};  play 
C  at  t  e  {1, 3,  5,  7, . . .}.  Otherwise,  play  D. 

Note  that  these  are  again  ‘trigger  strategies’:  the  players  ‘tacitly’  agree  on  a  certain 
sequence  (pattern)  of  play,  but  revert  to  playing  D  forever  after  a  deviation.  If  player 
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1  plays  Tr\  and  player  2  plays  7>2,  then  the  sequence  (C,  D ),  (D,  C),  (C,  D),  (D,C), 
. . .,  results.  To  see  why  (Tri ,  7>2)  might  be  a  subgame  perfect  equilibrium,  note  that 
on  average  a  player  obtains  45  per  stage,  which  is  more  than  the  40  that  would  be 
obtained  from  deviating  from  this  sequence  and  playing  D  forever.  More  precisely, 
suppose  player  2  plays  7>2  and  suppose  player  1  considers  a  deviation  from  Tr\ .  It 
is  optimal  to  deviate  at  an  even  moment,  say  at  t  —  0,  since  then  player  1  receives  a 
payoff  of  30,  and  can  obtain  40  by  deviating  to  D.  Since,  after  this  deviation,  player 

2  plays  D  forever,  the  maximal  total  discounted  payoff  to  player  1  from  deviating  is 
obtained  by  also  playing  D  forever  after  his  deviation  to  D  at  time  t  —  0,  and  this 
payoff  is  equal  to 


40  +  40(5  +  S2  +  . . .)  =  40/(1  -  5)  . 

If  player  1  does  not  deviate  and  sticks  to  the  strategy  Tr\  he  obtains 

30(1  +  52  +  54  +  . . .)  +  60(5  +  53  +  55  +  ...)  =  (30  +  605)/(l  -  52)  . 

To  keep  player  1  from  deviating  we  need  40/(1  —  5)  <  (30  +  605)/ (1  —  52),  which 
yields  5  >  1/2.  The  argument  if  the  roles  of  the  players  are  reversed  is  similar,  and 
we  conclude  that  for  each  5  >  1/2,  (Tri ,  Tr 2)  is  a  subgame  perfect  equilibrium  in 
G°°(5). 

More  generally,  by  playing  appropriate  sequences  of  elements  from  the  set  of 
possible  outcomes  {(C,  C),  (C,  D),  (D,  C),  (D,  D)}  of  the  stage  game  G,  the  players 
can  on  average  reach  any  convex  combination  of  the  associated  payoffs  in  the  long 
run.  That  is,  take  any  such  combination 

aq(50,  50)  +  0^(30,  60)  +  0^3(60,  30)  +  04 (40, 40)  ,  (*) 

where  a*  e  R,  oti  >  0  for  every  i  =  1, ...  ,4,  and  Y^i=  \  ai  —  1-  By  choosing 
a  sequence  of  possible  outcomes  such  that  (G,  G)  occurs  on  average  in  a  fraction 
a  1  of  the  stages,  ( C,D )  in  a  fraction  g?2,  ( D ,  C)  in  a  fraction  0*3,  and  (D,D)  in  a 
fraction  04,  then  the  payoffs  (*)  are  reached  as  averages  in  the  limit,  i.e.,  as  t  ->  00. 
These  are  indeed  average  payoffs,  independent  of  the  discount  factor  5.  If  these 
limit  average  payoffs  exceed  40  (the  Nash  equilibrium  payoff  of  the  stage  game)  for 
each  player,  associated  trigger  strategies  can  be  formulated  that  result  in  these  (limit 
average)  payoffs  and  that  trigger  eternal  play  of  ( D ,  D )  after  a  deviation,  similar  to 
the  strategies  7r(C),  Tr\  and  7>2  above. 

Note  that  for  a\  —  1  we  can  take  the  strategy  pair  (7r(C),  7r(C)).  For  —  <*3  = 
1/2  we  can  take  the  strategy  pair  (Tri ,  Trj). 

To  exhibit  yet  another  example,  consider  the  average  payoff  pair  (42,  48),  which 
is  equal  to 


1 

5 


(50,  50)  +  ^(30,  60)  +  1(60,  30)  +  ^(40,  40) 
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Fig.  7.1  For  every  payoff  pair  in  the  shaded  area  there  is  a  8  large  enough  such  that  this  payoff 
pair  can  be  obtained  as  the  limit  average  in  a  subgame  perfect  equilibrium  of  G°°(S),  where  G  is 
the  prisoners’  dilemma  game  of  Sect.  7.1.1 


This  means  that  the  payoffs  (42, 48)  can  be  obtained  on  average  by  playing  (C,  C) 
at  t  =  0,  5, 10, . . .;  (C,  D)  at  t  =  1, 2,  6, 7, 1 1, 12, . . ( D ,  C)  at  t  —  3,  8, 13, . . 
and  (D,  D)  at  t  —  4,  9, 14, . . .  Translated  to  trigger  strategies  we  have 

Tr*\  As  long  as  the  sequence  (C,  C),  ( C,D ),  ( C,D ),  (. D,C ),  ( D,D );  (C,  C),  (C,D), 
(C,D),  (. D,  C ),  (D,D);  ...  has  occurred  in  the  past  from  time  0  on,  play  C  at  t  € 

{0, 1, 2,  5,  6,  7, 10, 11, 12, . . play  D  at  t  €  {3,  4,  8,  9, 13, 14, . . .}.  Otherwise,  play  D. 

Tr*:  As  long  as  the  sequence  (C,  C),  (C,D),  ( C, Z> ),  (D,  C),  (D,D);  (C,  C),  (C,D), 

( C,Z> ),  (D,C),  (D,D)\  ...  has  occurred  in  the  past  from  time  0  on,  play  C  at  f  G 
{0,  3,  5,  8, 10, 13, . . .};  play  D  at  t  G  {1,  2,  4,  6,  7,  9, 11, 12, 14, . . .}.  Otherwise,  play  D. 

For  8  sufficiently  high,  these  strategies  form  a  subgame  perfect  equilibrium  of 
G°°(8).  (See  Problem  7.5.) 

Figure  7.1  shows  all  limit  average  payoffs  that  can  be  reached  in  this  way. 


7.1 .2  Some  General  Observations 

For  the  prisoners’  dilemma  game  we  have  established  that  each  player  playing 
always  D  is  a  subgame  perfect  equilibrium  of  G°°(8)  for  every  0  <  8  <  1.  The 
logic  is  simple.  If  player  2  always  plays  the  Nash  equilibrium  action  D  in  the  stage 
game,  then  player  1  can  never  do  better  than  playing  a  best  reply  action  in  the  stage 
game,  i.e.,  playing  D — ‘never’  means:  independent  of  the  history,  i.e.,  independent 
of  the  play  in  the  stage  game  thus  far,  i.e.,  in  any  subgame.  The  same  logic  holds  for 
any  stage  game,  that  is,  with  finitely  or  infinitely  many  actions,  with  any  arbitrary 
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number  of  players,  and  for  any  Nash  equilibrium  of  the  stage  game.  The  following 
proposition  merely  states  this  more  formally. 

Proposition  7.1  Let  G  be  any  arbitrary  ( not  necessarily  finite)  n-person  game,  and 
let  the  strategy  combination  s  =  (s\ , . . . ,  Si, . . . ,  sn)  be  a  Nash  equilibrium  in  G.  Let 
0  <  8  <  1.  Then  each  player  i  playing  Si  at  every  moment  t  is  a  sub  game  perfect 
equilibrium  in  G°°(S). 

In  particular,  this  proposition  holds  for  any  bimatrix  game  (see  Definition  3.1)  and 
any  (not  necessarily  pure)  Nash  equilibrium  in  this  bimatrix  game.  But  it  also  holds, 
for  instance,  for  the  Cournot  or  Bertrand  games  (cf.  Chap.  6)  with  two  or  more 
players  (see  Problems  7.6  and  7.7). 

Let  G  —  (A,S)  beanmx  n-bimatrix  game  with  A  =  ( a^ )  and  B  —  ( 'by ).  Let 
P(G)  be  the  convex  hull  of  the  set  {(a^,  bif)  e  M2  |  i  —  l, ...  ,m,  j  =  1 , ...  ,n}. 
That  is, 


m  n 

m  n  1 

P(G)  =  < 

aij(au >  biD 

T.  r,  a, -j  =  1  •  dij  >  0  for  all  ij ) 

C=1  J=l 

<'=  1  ;'=  1  J 

Equivalently,  to  obtain  the  set  P(G),  just  plot  all  m  x  n  payoff  pairs  in  M2,  and  take 
the  smallest  convex  polygon  containing  all  these  points. 

For  the  prisoners’  dilemma  game  G,  P(G)  is  the  quadrangle  with  vertices 
(40, 40),  (30,  60),  (60,  30),  and  (50,  50),  see  Fig.  7.1.  The  elements  (payoff  pairs) 
of  P(G)  can  be  obtained  as  limit  average  payoffs  in  the  infinitely  repeated  game 
G  by  an  appropriate  sequence  of  play,  as  demonstrated  before  in  Sect.  7.1.1.  The 
following  proposition  says  that  every  payoff  pair  in  P(G)  that  strictly  dominates 
the  payoffs  associated  with  a  Nash  equilibrium  of  G  can  be  obtained  as  limit 
average  payoffs  in  a  subgame  perfect  equilibrium  of  G°°(8)  for  8  large  enough. 
Such  a  proposition  is  known  as  a  folk  theorem.  Its  proof  (omitted  here)  is  somewhat 
technical  but  basically  consists  of  formulating  trigger  strategies  in  a  similar  way  as 
for  the  prisoners’  dilemma  game  above.  In  these  strategies,  after  a  deviation  from 
the  pattern  leading  to  the  desired  limit  average  payoffs,  players  revert  to  the  Nash 
equilibrium  under  consideration  of  the  stage  game  forever. 

Proposition  7.2  (Folk  Theorem  for  Subgame  Perfect  Equilibrium)  Let  (p*,  q*) 

be  a  Nash  equilibrium  of  G  —  (A,B),  and  let  x  =  ( X\,X2 )  €  P(G)  such  that 
x\  >  p*Aq*  and  x 2  >  p*Z?q*.  Then  there  is  a  0  <  <  1  and  a  subgame  perfect 

equilibrium  in  G°°(8*)  with  limit  average  payoffs  x. 


Tn  this  proposition  it  is  assumed  that  G°°(<5)  is  well-defined,  in  particular  that  the  discounted 
payoff  sums  are  finite. 
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Remark  7.3  For  the  purpose  of  this  chapter,  a  limit  average  payoff  pair  is  just  a 
payoff  pair  in  the  set  P(G ),  i.e.,  in  the  polygon  with  the  payoff  pairs  in  ( A,B )  as 
vertices.  More  formally,  if  £o>  £i  >  £2,  •  •  •  is  a  sequence  of  real  numbers,  then  the  limit 
average  of  this  sequence  is  the  number  lim^oo  assuming  that  this 

limit  exists.  In  other  words,  we  take  the  average  of  the  first  T  +  1  numbers  of  the 
sequence  and  let  T  go  to  infinity.  □ 


7.1 .3  Another  Example 

In  order  to  illustrate  Propositions  7.1  and  7.2  we  consider  another  example.  Let  the 
bimatrix  game  (stage  game)  G  —  (A,  B)  be  given  by 

L  R 

U  {4,4  0,2\ 

D  \  3,6  1, 8  y 

This  game  has  two  pure  strategy  Nash  equilibria  ( U,L )  and  ( D,R )  and  a  mixed 
equilibrium  ((|,  |),  (|,  ^)),  with  associated  payoffs  respectively  (4,  4),  (1,8),  and 
(2,  5).  The  set  of  limit  average  payoff  pairs  P(G )  is  depicted  in  Fig.  7.2. 

Proposition  7.1  applies  to  the  three  Nash  equilibria  of  the  stage  game: 

•  Player  1  always  playing  U  and  player  2  always  L  is  a  subgame  perfect  Nash 
equilibrium  of  G°°(8)  for  any  value  of  8.  The  payoffs  are  4/(1  —  8)  for  each.  The 
limit  average  payoffs  are  4  for  each. 


Fig.  7.2  The  crosshatched 
area  is  the  set  P{G )  of  the 
game  G  in  Sect.  7.1.3.  The 
shaded  area — all  points 
strictly  above  (2,  5) — as  well 
as  (2,5),  (4,4),  and  (1,8), 
are  the  limit  average  payoffs 
pairs  attainable  in  a  subgame 
perfect  equilibrium  of  G°°(£) 
according  to  Propositions  7.1 
and  7.1,  for  a  sufficiently  high 
value  of  8 


(0,2) 
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•  Player  1  always  playing  D  and  player  2  always  L  is  a  subgame  perfect  Nash 
equilibrium  of  G°°(S)  for  any  value  of  8.  The  payoff  is  1/(1  —  8)  for  player  1  and 
8/(1  —  8)  for  player  2.  The  limit  average  payoffs  are  1  and  8,  respectively. 

•  Player  1  always  playing  (|,  |)  and  player  2  always  (|,  |)  is  a  subgame  perfect 
Nash  equilibrium  of  G°°  (8)  for  any  value  of  8.  The  payoff  is  2/(1—  8)  for  player  1 
and  5/(1  —  8)  for  player  2.  The  limit  average  payoffs  are  2  and  5,  respectively. 

Proposition  7.2  says  that  all  payoffs  in  the  shaded  area  strictly  above  (2,  5)  in 
Fig.  7.2  can  be  obtained  as  limit  average  payoffs  in  G°°(8),  provided  8  is  sufficiently 
high. 

As  a  first  example  we  consider  the  payoffs  (3,6).  Clearly,  these  payoffs  can  be 
obtained  as  limit  average  payoffs  if  the  players  play  ( D ,  L)  always.  In  order  to  obtain 
them  in  a  subgame  perfect  equilibrium  we  can  use  the  mixed  Nash  equilibrium  of  the 
stage  game  as  a  punishment  after  a  deviation.  Specifically,  consider  the  following 
trigger  strategy  pair  (5*,  5*). 

5* :  Start  with  D  and  keep  playing  D  as  long  as  ( D ,  L )  has  been  played  so  far. 
After  any  deviation,  play  (^,  |)  forever. 

5* :  Start  with  L  and  keep  playing  L  as  long  as  ( D ,  L)  has  been  played  so  far. 
After  any  deviation,  play  (^,  |)  forever. 

In  this  case,  the  maximal  payoff  to  player  1  from  deviating  to  U,  given  that  player  2 
plays  5*,  is  equal  to  4  +  2  •  <5/(1  —  8),  since  player  2  switches  to  (|,  forever:  to 
this,  player  l’s  best  reply  is  to  also  play  (^,  |)  forever,  resulting  in  the  payoff  2  at 
each  stage.  This  maximal  payoff  is  smaller  than  or  equal  to  3/(1  —  8)  if  and  only  if 
8  >  1/2.  Similarly,  the  maximal  payoff  to  player  2  from  deviating  to  R ,  given  that 
player  1  plays  S*,  is  equal  to8  +  5-  <5/(l  —  8),  and  this  is  smaller  than  or  equal  to 
6/(1  —  8)  if  and  only  if  8  >  2/3.  We  conclude  that  (5*,  5*)  is  a  subgame  perfect 
equilibrium  of  G°°(8)  for  8  >  2/3. 

As  a  second  example,  consider  the  limit  average  payoff  pair  (3,5|).  From 
Proposition  7.2  we  can  conclude  that  these  payoffs  can  be  obtained  as  limit  average 
payoffs  in  a  subgame  perfect  equilibrium  of  G°°(8).  Note  that  we  can  write 
(3,  50  =  | (4,  4)  +  | (1, 8),  so  that  these  limit  averages  can  be  achieved  by  the 
sequence  of  play 


(U,L),(U,L),(D,R),(U,L),(U,L),(D,R), 


Consider  the  following  strategies. 

S i:  Play  U  at  times  t  —  0, 1, 3,  4,  6,  7, . . .  and  play  D  at  times  t  —  2,  5,  8, _ 


S2’.  Play  L  at  times  t  —  0, 1, 3, 4,  6,  7, . . .  and  play  R  at  times  t  —  2,  5,  8, _ 


130 


7  Repeated  Games 


Observe  that,  with  these  strategies,  the  players  play  a  Nash  equilibrium  of  the 
stage  game  at  every  time  t ,  which  implies  that  they  can  only  loose  by  deviating. 
Thus,  we  do  not  need  trigger  strategies  to  punish  deviations.  The  pair  (5i,  S2)  is  a 
subgame  perfect  equilibrium  of  G°°(S)  for  every  value  of  8  e  (0, 1). 

As  a  final  example,  consider  following  the  strategies  in  G°°(8)  for  players  1 
and  2,  respectively. 


S 1 :  Play  D  at  times  t  —  4,  9, 14, 19, . . .  and  play  U  at  all  other  times. 


S2 :  Play  R  at  times  t  —  4,  9, 14, 19, . . .  and  play  L  at  all  other  times. 


These  strategies  result  in  the  sequence  of  play 


(U,L),(U,L),(U,L),(U,L),(D,R),(U,L),(U,L),(U,L),(U,L),(D,R), 


resulting  in  limit  average  payoffs  |(4,  4)  +  |(1,  8)  =  (3|,  4|).  This  pair  is  outside 
the  shaded  region  in  Fig.  7.2,  i.e.,  does  not  dominate  the  pair  (2,5).  This  means 
that  the  mixed  equilibrium  ((^,  |),  (^,  |))  of  the  stage  game  cannot  serve  as  a 
punishment  for  deviations  by  player  2,  since  the  payoff  5  to  player  2  from  this 
equilibrium  is  larger  than  4| .  Nevertheless,  by  the  same  logic  as  for  the  pair  (S 1 ,  S2), 

/V 

the  strategy  pair  (Si,  £2)  is  a  subgame  perfect  equilibrium  of  G°°(8)  for  every  value 
of  $  £  (0, 1):  at  each  time  t  the  players  play  a  Nash  equilibrium  of  the  stage  game. 
This  example  shows  that  Proposition  7.2  is  not  exhaustive:  it  does  not  necessarily 
give  all  limit  average  payoff  pairs  attainable  in  a  subgame  perfect  equilibrium. 


7.2  Nash  Equilibrium 

In  this  section  we  consider  the  consequences  of  relaxing  the  subgame  perfection 
requirement  for  a  Nash  equilibrium  in  an  infinitely  repeated  game.  When  thinking 
of  trigger  strategies  as  in  Sect.  7.1,  this  means  that  deviations  can  be  punished  more 
severely,  since  the  equilibrium  does  not  have  to  induce  a  Nash  equilibrium  in  the 
‘punishment  subgame’. 

For  the  infinitely  repeated  prisoners’  dilemma  game  of  Sect.  7.1  this  has  no 
consequences.  In  this  game  each  player  can  guarantee  to  obtain  at  least  40  at  each 
stage,  so  that  more  severe  punishments  are  not  possible.  In  the  following  subsection 
we  consider  a  different  example. 

7.2.1  An  Example 

Consider  the  bimatrix  game 


G={A,B)=Ud 
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The  set  P(G )  (see  Sect.  7.1.2)  is  the  triangle  with  vertices  (1, 1),  (0, 0),  and  (—1,4). 
In  the  game  G  the  strategy  D  is  a  strictly  dominated  strategy  for  player  1.  The 
unique  Nash  equilibrium  of  the  stage  game  is  ( U,L ).  Player  1  always  playing  U 
and  player  2  always  playing  L  is  a  subgame  perfect  equilibrium  in  G°°{8)  for  every 
0  <  8  <  1,  cf.  Proposition  7.1.  Note  that  Proposition  7.2  does  not  add  anything  to 
this  observation,  since  P(G)  does  not  contain  any  payoff  pair  larger  than  (1,  1)  for 
each  player. 

Now  consider  the  following  strategy  pair  (N\ ,  N2)  in  the  infinitely  repeated  game 
G°°(8). 

N\ :  At  t  —  0  play  D.  After  a  history  where  ( D ,  R )  was  played  at  t  =  0,  4,  8, 12, . . . 
and  ( U ,  L)  at  all  other  times:  play  Datt  =  0,4,8, 12,...  and  play  U  at  all  other 
times.  After  any  other  history  play  the  mixed  action  (|,  ^),  that  is,  play  U  with 
probability  |  and  D  with  probability  | . 

N2 :  At  t  =  0  play  R.  After  a  history  where  ( D ,  R)  was  played  at  t  =  0,  4,  8, 12, . . . 
and  ( U ,  L)  at  all  other  times:  play  7?  at  t  =  0,4,8, 12,...  and  play  L  at  all  other 
times.  After  any  other  history  play  R. 

These  strategies  are  again  trigger  strategies.  They  induce  a  sequence  of  play  in  which 
within  each  four  times,  ( D,R )  is  played  once  and  ( U,L )  is  played  thrice.  After  a 
deviation  player  1  plays  the  mixed  action  (|,  |)  and  player  2  the  pure  action  R 
forever.  Thus,  in  a  subgame  following  a  deviation  the  players  do  not  play  a  Nash 
equilibrium:  if  player  2  plays  R  always,  then  player  l’s  best  reply  is  to  play  U 
always.  Hence,  (N\ ,  N2)  is  not  a  subgame  perfect  equilibrium. 

We  claim,  however,  that  ( Afi ,  N2)  is  a  Nash  equilibrium  if  8  is  sufficiently  large. 
First  observe  that  player  2  can  never  achieve  a  momentary  gain  from  deviating 
since,  if  player  1  plays  Afi ,  then  N2  requires  player  2  to  play  a  best  reply  in  the  stage 
game  at  every  moment  t.  Moreover,  after  any  deviation  player  1  plays  (|,  |)  at  any 

moment  t ,  so  that  both  L  and  R  have  an  expected  payoff  of  |  for  player  2,  which  is 
less  than  1  and  less  than  4. 

Suppose  player  2  plays  N2.  If  player  1  wants  to  deviate  from  N\ ,  the  best  moment 
to  do  so  is  one  where  he  is  supposed  to  play  D,  so  at  t  —  0,  4, . . ..  Without  loss  of 
generality  suppose  player  1  deviates  at  t  —  0.  Then  (U,  R)  is  played  at  t  —  0, 
yielding  payoff  0  to  player  1 .  After  that,  player  2  plays  R  forever,  and  the  best  reply 
of  player  1  to  this  is  to  play  U  forever,  again  yielding  0  each  time.  So  his  total  payoff 
from  deviating  is  0.  Without  deviation  player  1  ’s  total  discounted  payoff  is  equal  to 

-1(5°  +  84  +  <58  +  ...)  +  1(5‘  +  82  +  S3  +  S5  +  S6  +  S7  +  . . .)  . 


In  order  to  keep  player  1  from  deviating  this  expression  should  be  at  least  0,  i.e. 


-1 

1  ~4  + 


>  0 


1  -  8  1  -S4 


132 


7  Repeated  Games 


which  holds  if  and  only  if  8  >  with  <5*  %  0.54.  Hence,  for  these  values 
of  8 ,  (Ah,  A/2)  is  a  Nash  equilibrium  in  G°°(8).  The  limit  average  payoffs  in  this 
equilibrium  are  equal  to  |(1, 1)  +  |(—  1, 4),  hence  to  (|,  |). 

The  actions  played  in  this  equilibrium  after  a  deviation  are,  in  fact,  the  actions 
that  keep  the  opponent  to  his  maximin  payoff.  To  see  this,  first  consider  the  action 
of  player  2,  R.  The  payoff  matrix  of  player  1  is  the  matrix  A  with 


The  value  of  the  matrix  game  A  (cf.  Chap.  2)  is  equal  to  0 — in  fact,  ( U,R )  is  a 
saddlepoint  of  A — and,  thus,  player  1  can  always  obtain  at  least  0.  By  playing  R , 
which  is  player  2’s  optimal  strategy  in  A,  player  2  can  hold  player  1  down  to  0. 
Hence,  this  is  the  most  severe  punishment  that  player  2  can  inflict  upon  player  1 
after  a  deviation. 

Similarly,  if  we  view  the  payoff  matrix  B  for  player  2  as  a  zerosum  game  with 
payoffs  to  player  2  and,  following  convention,  convert  this  to  a  matrix  game  giving 
the  payoffs  to  player  1 ,  we  obtain 


L  R 


In  this  game,  (|,  |)  is  an  (the)  optimal  strategy  for  player  1,  yielding  the  value  of 
the  game,  which  is  equal  to  —  |.  Hence,  player  2  can  guarantee  to  obtain  a  payoff 
of  |,  but  player  1  can  make  sure  that  player  2  does  not  obtain  more  than  this  by 

playing  (|,  |).  Again,  this  is  the  most  severe  punishment  that  player  1  can  inflict 
upon  player  2  after  a  deviation. 

By  using  these  punishments  in  a  trigger  strategy,  the  same  logic  as  in  Sect.  7.1 
tells  us  that  any  pair  of  payoffs  in  P(G )  that  strictly  dominates  the  pair  v  = 
(u(A),  — 1?(— B))  —  (0,  |)  can  be  obtained  as  limit  average  payoffs  in  a  Nash 
equilibrium  of  the  game  G°°(8 )  for  8  sufficiently  large.  This  is  illustrated  in  Fig.  7.3. 


7.2.2  A  Folk  Theorem  for  Nash  Equilibrium 

Let  G  =  (A,  B)  be  an  arbitrary  m  x  n  bimatrix  game.  Let  v(A)  be  the  value  of  the 
matrix  game  A  and  let  v(—B)  be  the  value  of  the  matrix  game  —B.  Let  the  set  P(G) 
be  defined  as  in  Sect.  7.1.2.  The  following  proposition  generalizes  what  we  found 
above. 
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Fig.  7.3  For  every  payoff 
pair  in  the  shaded  area  there 
is  a  8  large  enough  such  that 
this  payoff  pair  can  be 
obtained  as  the  pair  of  limit 
averages  in  a  Nash 
equilibrium  of  G°°  (5) 


Proposition  7.4  (Folk  Theorem  for  Nash  Equilibrium)  Let  x  =  (x\ ,  X2)  e  P(G) 

such  that  x\  >  v(A)  and  x 2  >  —v  C ~B )■  Tfen  /fere  /s  0  0  <  5*  <  1  afe  a  Nash 
equilibrium  in  G°°(8*)  with  limit  average  payoffs  x. 

The  set  of  payoff  pairs  that  can  be  reached  as  limit  average  payoff  pairs  in  a  Nash 
equilibrium  (Proposition  7.4)  contains  the  set  of  payoff  pairs  that  can  be  obtained 
this  way  in  a  subgame  perfect  equilibrium  (Proposition  7.2).  This  follows  because 
the  payoffs  in  a  Nash  equilibrium  of  the  stage  game  (A,  B )  are  at  least  as  large  as  the 
payoffs  ( v(A ),  —v(—B)):  if  not  then  a  player  could  improve  by  playing  an  optimal 
strategy  in  the  associated  matrix  game,  guaranteeing  v(A)  (player  1)  or  —v(—B) 
(player  2). 


7.2.3  Another  Example 


Consider  the  bimatrix  game 


G=(A,B) 


L  R 

U(5, 1  1,2\ 

D  (4,2  2,4,1  • 


The  matrix  game  A  has  a  saddlepoint  at  ( D ,  R ).  Thus,  v(A)  =  2  and  R  is  the  unique 
optimal  strategy  of  player  2  in  A.  The  matrix  game  — B ,  given  by 


L  R 


has  a  saddlepoint  at  ( U,R ).  Its  value  is  v(—B)  =  —2,  and  U  is  the  unique  optimal 
strategy  of  player  1  in  —B.  The  set  of  limit  average  payoff  pairs  and  the  point  v  = 
(f(A),  —v(—B))  =  (2,2)  are  depicted  in  Fig.  7.4. 
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Fig.  7.4  For  every  payoff 
pair  in  the  shaded  area  there 
is  a  8  large  enough  such  that 
this  payoff  pair  can  be 
obtained  as  the  pair  of  limit 
averages  in  a  Nash 
equilibrium  of  G°°  (<$) 


Consider,  for  instance,  the  limit  average  payoffs  (3,3),  and  the  following  strategy 
pair  in  G°°(S). 

A* :  Start  by  playing  D  and  keep  playing  D  as  long  as  ( D ,  L)  was  played  at  even 
moments  and  ( D ,  R )  at  odd  moments.  After  any  deviation  from  this,  play  U 
forever. 

N*:  Play  L  at  even  moments  and  R  at  odd  moments  as  long  as  ( D ,  L)  was  played  at 
even  moments  and  (D,  R)  at  odd  moments.  After  any  deviation  from  this,  play 
R  forever. 

These  strategies  result  in  the  limit  average  payoffs  |(4,  2)  +  2(2,  4)  =  (3,3). 

Suppose  player  2  plays  A* .  If  player  1  deviates  from  A* ,  it  is  optimal  to  do  so  at 
even  moments,  say  at  t  =  0.  His  payoff  from  deviating  is  5  +  2  •  5/ (1  —  8),  since 
after  this  deviation  player  2  plays  R  forever  and  player  l’s  best  reply  is  to  play  D 
forever.  If  player  1  plays  according  to  A*  he  obtains  4/(1  —  82)  +  2  •  <5/(1  —  82).  To 
obtain  an  equilibrium,  this  should  be  at  least  as  large  as  the  maximal  payoff  from 
deviating,  hence 


4/(1  -  82)  +  2  •  8/(1  -  82)  >  5  +  2  •  8/(1  -  8)  , 

which  holds  if  and  only  if  8  >  j  s/3  ^  0.58. 

Suppose  player  1  plays  A* .  If  player  2  deviates  from  A* ,  it  is  optimal  to  do  so  at 
even  moments,  say  at  t  =  0.  His  payoff  from  deviating  is  4  +  2  •  5/ (1  —  5),  since 
after  this  deviation  player  1  plays  U  forever  and  player  2’s  best  reply  is  to  play  R 
forever.  If  player  2  plays  according  to  A*  he  obtains  2/(1  —  82)  +  4  •  8/(\  —  82).  To 
obtain  an  equilibrium,  this  should  be  at  least  as  large  as  the  maximal  payoff  from 
deviating,  hence 


2/(1  -  82)  +  4  •  8/(1  -  82)  >  4  +  2  •  8/(1  -  8)  , 

which  holds  if  and  only  if  8  >  \  V5  —  |  ^  0.62. 

Thus,  (A*,  A*)  is  a  Nash  equilibrium  of  G°°(8 )  for  8  >  0.62.  Note  that  this 
equilibrium  is  not  subgame  perfect:  in  a  subgame  following  a  deviation  the  players 
end  up  playing  (U,  R)  forever,  which  is  not  a  Nash  equilibrium  in  such  a  subgame. 
Also  note  that  the  only  Nash  equilibrium  in  the  stage  game  is  (D,R).  Hence, 
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Propositions  7.1  and  7.2  only  imply  that  the  limit  average  payoffs  (2,4)  can  be 
obtained  in  a  subgame  perfect  Nash  equilibrium  of  G°°(S). 


7.3  Problems 


7.1.  Nash  and  Sub  game  Perfect  Equilibrium  in  a  Repeated  Game  (1) 
Consider  the  following  bimatrix  game: 


G=(A,B) 


L  R 

U  (2,3  1,5\ 

D\  0,1  0,1  )' 


(a)  Determine  all  Nash  equilibria  of  this  game.  Also  determine  the  value  v(A)  of 
the  matrix  game  A  and  the  value  v(—B)  of  the  matrix  game  —B.  Determine  the 
optimal  strategies  of  player  2  in  A  and  of  player  1  in  —B. 

(b)  Consider  the  repeated  game  G°°(8).  Which  limit  average  payoffs  can  be 
obtained  in  a  subgame  perfect  equilibrium  of  this  repeated  game  according  to 
Proposition  7.1  or  Proposition  7.2?  Does  this  depend  on  83 

(c)  Which  limit  average  payoffs  can  be  obtained  in  a  Nash  equilibrium  in  G°°(8) 
according  to  Proposition  7.4? 

(d)  Describe  a  pair  of  Nash  equilibrium  strategies  in  G°°(8)  that  result  in  the  limit 
average  payoffs  (2,3).  What  is  the  associated  minimum  value  of  81 


7.2.  Nash  and  Subgame  Perfect  Equilibrium  in  a  Repeated  Game  (2) 
Consider  the  following  bimatrix  game: 


G=(A,B) 


L  R 

U  (2, 1  0,0\ 

D  1^0,0  1,2 )  ' 


(a)  Which  payoffs  can  be  reached  as  limit  average  payoffs  in  a  subgame  perfect 
equilibrium  of  the  infinitely  repeated  game  G°°(8)  for  suitable  choices  of  8 
according  to  Propositions  7.1  and  7.2? 

(b)  Which  payoffs  can  be  reached  as  limit  average  payoffs  in  a  Nash  equilibrium 
of  the  infinitely  repeated  game  G°°(8)  for  suitable  choices  of  8  according  to 
Proposition  7.4? 

(c)  Describe  a  subgame  perfect  Nash  equilibrium  of  G°°(8)  resulting  in  the  limit 
average  payoffs  (|,  |).  Also  give  the  corresponding  restriction  on  8. 

(d)  Describe  a  subgame  perfect  Nash  equilibrium  of  G°°(<5)  resulting  in  the  limit 
average  payoffs  (1, 1).  Also  give  the  corresponding  restriction  on  8. 
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7.3.  Nash  and  Subgame  Perfect  Equilibrium  in  a  Repeated  Game  (3) 
Consider  the  following  bimatrix  game: 


G=(A,B ) 


L  R 

U  ( 3,2  8,0\ 

D  y  4, 0  6,2  ) 


(a)  Which  payoffs  can  be  reached  as  limit  average  payoffs  in  a  subgame  perfect 
Nash  equilibrium  of  the  infinitely  repeated  discounted  game  G°°(8)  for  suitable 
choices  of  8  according  to  Propositions  7.1  and  7.2? 

(b)  Which  payoffs  can  be  reached  as  limit  average  payoffs  in  a  Nash  equilibrium 
of  the  infinitely  repeated  game  G°°(8)  for  suitable  choices  of  8  according  to 
Proposition  7.4? 

(c)  Describe  a  Nash  equilibrium  of  G°°(8)  resulting  in  the  limit  average  payoffs 
(4|,  2).  Is  there  any  value  of  8  for  which  this  equilibrium  is  subgame  perfect? 
Why  or  why  not? 


7.4.  Subgame  Perfect  Equilibrium  in  a  Repeated  Game 
Consider  the  following  bimatrix  game: 

L  C  R 

T  /  6,  4  0,7  0,0\ 

G  =  M  8,0  4,6  0,0  . 

B  \0,0  0,0  1, 1  / 

(a)  What  are  the  pure  strategy  Nash  equilibria  of  this  game? 

(b)  Which  limit  average  payoffs  can  be  obtained  in  a  subgame  perfect  equilibrium 
of  G°°(8)  according  to  your  answer  to  (a)  and  Propositions  7.1  and  7.2? 

(c)  Describe  a  subgame  perfect  equilibrium  in  the  infinitely  repeated  game  which 
results  in  the  limit  average  payoffs  of  5  for  each  player.  Also  give  the  minimum 
value  of  8  for  which  your  strategy  combination  is  a  subgame  perfect  equilibrium. 

7.5.  The  Strategies  7r*  and  Tr* 

Give  the  inequalities  that  determine  the  lower  bound  on  8  for  the  strategy  com¬ 
bination  (7r* ,  7r* )  to  be  a  subgame  perfect  equilibrium  in  the  infinitely  repeated 
prisoners’  dilemma  game  in  Sect.  7.1.1. 


7.6.  Repeated  Cournot  and  Bertrand 


(a)  Reconsider  the  duopoly  (Cournot)  game  of  Sect.  6.2.1.  Suppose  that  this  game 
is  repeated  infinitely  many  times,  and  that  the  two  firms  discount  the  streams 
of  payoffs  by  a  common  discount  factor  8.  Describe  a  subgame  perfect  Nash 
equilibrium  of  the  repeated  game  that  results  in  each  firm  receiving  half  of  the 
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monopoly  profits  at  each  time.  Also  give  the  corresponding  restriction  on  8. 
What  could  be  meant  by  the  expression  ‘tacit  collusion’? 

(b)  Answer  the  same  questions  as  in  (a)  for  the  Bertrand  game  of  Sect.  6.3. 

7.7.  Repeated  Duopoly 

Two  firms  (1  and  2)  offer  heterogenous  goods  at  prices 

pi  =  max{10  -  2qi  +  q2, 0} 
p2  =  max{10  -  lq2  +  <?i,0} 

where  q\ ,  q2  are  the  quantities  offered.  All  costs  are  assumed  to  be  zero,  and  the 
firms  are  engaged  in  price  competition. 


(a)  Show  that  the  market  clearing  quantity  of  firm  1  at  prices  p\  and  p2  is  given  by 


,  2  1 

qi  =  max{10  -  -p i  -  - p2 , 0} 


Also  derive  the  quantity  q2  of  firm  2  as  a  function  of  the  prices,  and  set  up  the 
profit  functions  of  the  two  firms. 

(b)  Derive  the  reaction  functions  of  the  firms  and  use  these  to  compute  the  Nash 
equilibrium  prices.  Compute  the  associated  profits  of  both  firms. 

(c)  Compute  the  prices  at  which  joint  profit  is  maximized.  Compute  the  associated 
profits  of  both  firms. 

(d)  Now  suppose  that  this  price  competition  game  is  played  infinitely  many  times, 
and  that  the  firms’  payoffs  are  discounted  by  a  common  factor  8.  Describe  a 
subgame  perfect  equilibrium  in  which  joint  profit  is  maximized  in  each  period. 
Also  give  the  associated  lower  bound  for  the  discount  factor  8. 


7.8.  On  Discounting 

In  a  repeated  game,  interpret  the  discount  factor  0  <  8  <  1  as  the  probability 
that  the  game  will  continue,  i.e.,  that  the  stage  game  will  be  played  again.  Show 
that,  with  this  interpretation,  the  repeated  game  will  end  with  probability  1.  (Cf. 
Problem  6.17(e).) 


7.9.  On  Limit  Average 

Can  you  give  an  example  in  which  the  limit  that  defines  the  long  run  average  payoffs, 
does  not  exist?  (Cf.  Remark  7.3.) 
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7.4  Notes 

For  finitely  repeated  games  as  in  Problems  4.10-4.12  see  also  Benoit  and  Krishna 
(1985)  and  Friedman  (1985). 

Fudenberg  and  Maskin  (1986)  show  that  Proposition  7.4 — the  folk  theorem  for 
Nash  equilibrium  in  infinitely  repeated  games — also  holds  for  subgame  perfect  Nash 
equilibrium  if  the  dimension  of  the  ‘cooperative  payoff  space’  (the  set  P(G )  in  the 
text)  is  equal  to  the  number  of  players.  This,  however,  requires  more  sophisticated 
strategies.  See  Fudenberg  and  Tirole  (1991a)  for  further  references. 

The  expression  ‘folk  theorem’  refers  to  the  fact  that  results  like  this  had  been 
known  among  game  theorists  even  before  they  were  formulated  and  written  down 
explicitly.  They  belonged  to  th t  folklore  of  game  theory. 

For  more  advanced  and  elaborate  treatments  of  repeated  games  see,  e.g., 
Fudenberg  and  Tirole  (1991a),  Mailath  and  Samuelson  (2006),  or  Myerson  (1991). 
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In  an  evolutionary  game,  players  are  interpreted  as  populations — of  animals  or 
individuals.  The  probabilities  in  a  mixed  strategy  of  a  player  in  a  bimatrix  game 
are  interpreted  as  shares  of  the  population.  Individuals  within  the  same  part  of  the 
population  play  the  same  pure  strategy.  The  main  ‘solution’  concept  is  the  concept 
of  an  evolutionary  stable  strategy. 

Evolutionary  game  theory  originated  in  biology.  The  developed  evolutionary 
biological  concepts  were  later  applied  to  boundedly  rational  human  behavior,  and  a 
connection  was  established  with  dynamic  systems  and  with  game-theoretic  concepts 
such  as  Nash  equilibrium. 

This  chapter  presents  a  short  introduction  to  evolutionary  game  theory.  For  a 
more  advanced  continuation  see  Chap.  15. 

In  Sect.  8.1  we  consider  symmetric  two-player  games  and  evolutionary  stable 
strategies.  Evolutionary  stability  is  meant  to  capture  the  idea  of  mutation  from 
the  theory  of  evolution.  We  also  establish  that  an  evolutionary  stable  strategy  is 
part  of  a  symmetric  Nash  equilibrium.  In  Sect.  8.2  the  connection  with  the  so- 
called  replicator  dynamics  is  studied.  Replicator  dynamics  intends  to  capture  the 
evolutionary  idea  of  selection  based  on  fitness.  In  Sect.  8.3  asymmetric  games  are 
considered.  Specifically,  a  connection  between  replicator  dynamics  and  strict  Nash 
equilibrium  is  discussed. 


8.1  Symmetric  Two-Player  Games  and  Evolutionary  Stable 
Strategies 

A  famous  example  from  evolutionary  game  theory  is  the  Hawk-Dove  game : 

Hawk  Dove 
Hawk  /  0, 0  3, 1  \ 

Dove  V  1,3  2,2  J  * 
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This  game  models  the  following  situation.  Individuals  of  the  same  large  population 
meet  randomly,  in  pairs,  and  behave  either  aggressively  (Hawk)  or  passively 
(Dove) — the  fight  is  about  nest  sites  or  territories,  for  instance.  This  behavior  is 
genetically  determined,  so  an  individual  does  not  really  choose  between  the  two 
modes  of  behavior.  The  payoffs  reflect  (Darwinian)  fitness,  e.g.,  the  number  of 
offspring.  In  this  context,  players  1  and  2  are  just  two  different  members  of  the  same 
population  who  meet:  indeed,  the  game  is  symmetric — see  below  for  the  formal 
definition.  A  mixed  strategy  p  =  (p\,pi)  (of  player  1  or  player  2)  is  naturally 
interpreted  as  expressing  the  population  shares  of  individuals  characterized  by  the 
same  type  of  behavior.  In  other  words,  p\  x  100%  of  the  population  are  Hawks 
and  p2  x  100%  are  Doves.  In  view  of  this  interpretation,  in  what  follows  we  are 
particularly  interested  in  symmetric  Nash  equilibria,  i.e.,  Nash  equilibria  in  which 
the  players  use  the  same  strategy.  The  Hawk-Dove  game  has  three  Nash  equilibria, 
only  one  of  which  is  symmetric  namely  ((^,  |),(j,  |)). 

Remark  8.1  The  Hawk-Dove  game  can  also  be  interpreted  as  a  Game  of  Chicken. 
Two  car  drivers  approach  each  other  on  a  road,  each  one  driving  in  the  middle.  The 
driver  who  is  the  first  to  return  to  his  own  lane  (Dove)  doses’  the  game,  the  one 
who  stays  in  the  middle  ‘wins’  (Hawk).  With  this  interpretation  also  the  asymmetric 
equilibria  are  of  interest.  The  asymmetric  equilibria  can  also  be  of  interest  within 
the  evolutionary  approach:  the  Hawk-Dove  game  can  be  interpreted  as  modelling 
competition  between  two  species,  represented  by  the  row  and  the  column  player. 
Within  each  species,  there  are  again  two  types  of  behavior.  See  Sect.  8.3.  □ 

The  definitions  of  a  symmetric  game  and  a  symmetric  Nash  equilibrium  are  as 
follows. 

Definition  8.2  Let  G  —  (A,  B)  be  an  m  x  n  bimatrix  game.  Then  G  is  symmetric 
if  m  —  n  and  by  —  ap  for  all  ij  —  1, . . . ,  m.  A  Nash  equilibrium  (p*,  q*)  of  G  is 
symmetric  if  p*  =  q*.  □ 

In  other  words,  a  bimatrix  game  (A,  B )  is  symmetric  if  both  players  have  the  same 
number  of  pure  strategies  and  the  payoff  matrix  of  player  2  is  the  transpose  of  the 
payoff  matrix  of  player  1,  i.e.,  we  obtain  B  by  interchanging  the  rows  and  columns 
of  A.  This  will  also  be  denoted  by  B  —  Ar ,  where  T’  stands  for  ‘transpose’.  A  Nash 
equilibrium  is  symmetric  if  both  players  play  the  same  strategy. 

We  state  the  following  fact  without  a  proof  (see  Chap.  15). 

Proposition  8.3  Every  symmetric  bimatrix  game  G  has  a  symmetric  Nash 
equilibrium. 

With  the  interpretation  above — different  types  of  behavior  within  one  and  the  same 
population — it  is  only  meaningful  to  consider  symmetric  Nash  equilibria.  But  in 
fact,  we  will  require  more. 
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Let  G  —  (A,  B)  be  a  symmetric  game.  Knowing  that  the  game  is  symmetric,  it 
is  sufficient  to  know  the  payoff  matrix  A,  since  B  —  AT .  In  what  follows,  when 
we  consider  a  symmetric  game  A  we  mean  the  game  G  =  (A,  AT).  Let  A  be  an 
m  x  m  matrix.  Recall  (Chaps.  2  and  3)  that  Am  denotes  the  set  of  mixed  strategies 
(for  player  1  or  player  2). 

The  main  concept  in  evolutionary  game  theory  is  that  of  an  evolutionary  stable 
strategy.  The  original  concept  will  be  formally  introduced  in  Chap.  15.  Here,  we 
give  an  equivalent  but  easier  to  handle  definition.  With  some  abuse  of  language  we 
give  it  the  same  name. 

Definition  8.4  Let  A  be  an  m  x  m  matrix.  A  strategy  x  e  Am  is  an  evolutionary 
stable  strategy  (ESS)  if  the  following  two  conditions  hold. 

(a)  (x,  x)  is  a  Nash  equilibrium  in  (A,  AT ). 

(b)  For  every  y  e  Am  with  y /xwe  have: 

xAx  =  yAx  =y  xAy  >  yAy.  (8.1) 

□ 

To  interpret  this  definition,  think  again  of  x  as  shares  of  one  and  the  same  large 
population.  The  first  condition  says  that  this  population  is  in  equilibrium:  x  is  one 
of  the  possible  distributions  of  shares  that  maximize  average  fitness  against  x.  The 
second  condition  concerns  mutations.  Suppose  there  is  another  distribution  of  shares 
y  (a  mutation)  that  fares  equally  well  against  x  as  x  itself  does:  y  is  an  alternative 
‘best  reply’  to  x.  Then  (8.1)  says  that  x  fares  better  against  y  than  y  does  against 
itself.  Hence,  y  does  not  take  over:  the  ‘mutation’  y  is  not  successful.  The  original 
definition  of  ESS  is  phrased  in  terms  of  small  mutations,  but  this  turns  out  to  be 
equivalent  to  the  definition  above  (Chap.  15). 

The  evolutionary  stable  strategies  for  an  m  x  m  matrix  A  can  be  found  as  follows. 
First,  compute  the  symmetric  Nash  equilibria  of  the  game  G  —  (A,  B)  with  B  —  AT . 
This  can  be  done  using  the  methods  developed  in  Chap.  3.  Second,  for  each  such 
equilibrium  (x,  x),  check  whether  (8.1)  holds.  If  it  does,  then  x  is  an  evolutionary 
stable  strategy. 

We  apply  this  method  to  the  Hawk-Dove  game.  For  this  game, 

Hawk  Dove 

_  Hawk  /  0  3  \ 

A  "  Dove  \  l  2  J 

The  unique  symmetric  equilibrium  strategy  was  x  =  (|,  ^).  The  condition  xAx  = 
yAx  in  (8.1)  is  satisfied  for  every  y  =  (y,  1  —  y).  This  can  be  seen  by  direct 
computation  but  it  also  follows  from  the  fact  that  (x,  x)  is  a  Nash  equilibrium  and  x 
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has  all  coordinates  positive  (how?).  Hence,  we  have  to  check  if 

xAy  >  y^y 

for  all  y  =  (y,  1  —  y)  ^  x.  This  inequality  reduces  (check!)  to: 

(2 y  -  l)2  >  0  , 

which  is  true  for  all  y  7^  Thus,  x  =  ( |,  2)  is  the  unique  ESS  in  A. 

8.2  Replicator  Dynamics  and  Evolutionary  Stability 

Central  in  the  theory  of  evolution  are  the  concepts  of  mutation  and  selection.  While 
the  idea  of  mutation  is  meant  to  be  captured  by  the  concept  of  evolutionary  stability, 
the  idea  of  selection  is  captured  by  the  so-called  replicator  dynamics.  We  illustrate 
the  concept  of  replicator  dynamics  by  considering  again  the  Hawk-Dove  game 

Hawk  Dove 
Hawk  /  0, 0  3, 1  \ 

Dove  V  1,3  2,2  )  ' 

Consider  a  mixed  strategy  or,  in  the  present  context,  vector  of  population  shares 
x  =  (x,l  —  x).  Consider  an  arbitrary  individual  of  the  population.  Playing  ‘Hawk’ 
against  the  population  x  yields  an  expected  payoff  or  ‘fitness’  of 

0  •  x  +  3  •  (1  —  x)  —  3(1  —  v) 


and  playing  ‘Dove’  yields 


1  •  x  +  2  •  (1  —  x)  —  2  —  x  . 

Hence,  the  average  fitness  of  the  population  is 

v  •  3(1  —  x)  +  (1  —  x)  •  (2  —  x)  —  2  —  2x2  . 

We  now  assume  that  the  population  shares  develop  over  time,  i.e.,  that  r  is  a 
function  of  time  t ,  and  that  the  change  in  x,  described  by  the  time  derivative 
x  —  x(t)  —  dx(t)/dt ,  is  proportional  to  the  difference  between  Hawk’s  fitness  and 
average  fitness.  In  particular,  if  Hawk’s  fitness  is  larger  than  average  fitness,  then  the 
percentage  of  Hawks  increases,  and  if  Hawk’s  fitness  is  smaller  than  average  fitness, 
then  the  percentage  of  Hawks  decreases.  In  case  of  equality  the  population  is  at  rest. 
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Fig.  8.1  Replicator  dynamics  for  the  Hawk-Dove  game 


Formally,  we  assume  that  x  is  given  by  the  following  equation. 

x(t)  —  dx(t)/dt  —  x(t)  [3(1  —  v(f))  —  (2  —  2 v(f)2)]  .  (8.2) 

Equation  (8.2)  is  the  replicator  dynamics  for  the  Hawk-Dove  game.  The  equation 
says  that  the  population  of  Hawks  changes  continuously  (described  by  dx(t)/dt), 
and  that  this  change  is  proportional  to  the  difference  between  the  fitness  at  time  t — 
which  is  equal  to  3(1  —  v(f)) — and  the  average  fitness  of  the  population — which  is 
equal  to  2  —  2 x(t)2 .  Simplifying  (8.2)  and  writing  v  instead  of  x(t)  yields 

x  —  dx/dt  —  x(x  —  l)(2x  —  1). 

This  makes  it  possible  to  draw  a  diagram  of  dx/dt  as  a  function  of  v  (a  so-called 
phase  diagram).  See  Fig.  8.1.  We  see  that  this  replicator  dynamics  has  three  different 
roots,  the  so-called  rest  points  x  —  0,  v  =  |,  and  x  =  l.  For  these  values  of  x, 
the  derivative  dx/dt  is  equal  to  zero,  so  the  population  shares  do  not  change:  the 
system  is  at  rest.  In  case  v  =  0  all  members  of  the  species  are  Doves,  their  fitness 
is  equal  to  the  average  fitness,  and  so  nothing  changes.  This  rest  point,  however,  is 
not  stable.  A  slight  disturbance,  e.g.,  a  genetic  mutation  resulting  in  a  Hawk,  makes 
the  number  of  Hawks  increase  because  dx/dt  becomes  positive.  This  increase  will 
go  on  until  the  rest  point  x  =  |  is  reached.  A  similar  story  holds  for  the  rest  point 
x  =  l,  where  the  population  consists  of  only  Hawks.  Now  suppose  the  system  is  at 
the  rest  point  x  —  \.  Note  that,  after  a  disturbance  in  either  direction,  the  system 
will  move  back  again  to  the  state  where  half  the  population  consists  of  Doves.  Thus, 
of  the  three  rest  points,  only  x  =  |  is  stable.  (A  formal  definition  of  stability  of  a 
rest  point  is  provided  in  Chap.  15.) 

Recall  from  the  previous  section  that  x  =  (|,  \)  is  also  the  unique  evolutionary 
stable  strategy  of  the  Hawk-Dove  game.  That  this  is  no  coincidence  follows  from 
the  next  proposition,  which  we  state  here  without  a  proof  (see  Chap.  15  for  a  proof). 
(The  definition  of  replicator  dynamics  is  analogous  to  the  one  in  the  Hawk-Dove 
game.) 


i 


In  the  literature  also  called  equilibrium  points,  critical  points,  stationary  points. 


144 


8  An  Introduction  to  Evolutionary  Games 


Proposition  8.5  Let  A  — 

ci22-  Then: 


f  a\\  an 
\a21  <222 


be  a  2x2  matrix  with  an  ail  and  an  ^ 


(a)  A  has  at  least  one  evolutionary  stable  strategy. 

(b)  x  =  (x,  l  —  x)  is  an  evolutionary  stable  strategy  of  A  if  and  only  ifx  is  a  stable 
rest  point  of  the  replicator  dynamics. 

Remark  8.6  For  general  m  x  m  matrices  the  set  of  completely  mixed  (i.e.,  with 
all  coordinates  positive)  rest  points  of  the  replicator  dynamics  coincides  with  the 
set  of  completely  mixed  strategies  in  symmetric  Nash  equilibria.  There  are  also 
connections  between  stability  of  rest  points  and  further  properties  of  Nash  equilibria. 
See  Chap.  15  for  details.  □ 

Example  8. 7  As  another  example,  consider  the  matrix 


v  w 


The  bimatrix  game  ( A,AT )  has  three  Nash  equilibria  all  of  which  are  symmetric, 
namely:  (V,  V),  (W,  W),  and  ((1/3, 2/3),  (1/3, 2/3)).  Against  V  the  unique  best 
reply  is  V,  so  that  V  is  an  ESS:  (8.1)  is  satisfied  trivially.  By  a  similar  argument,  W 
is  an  ESS. 

Against  (1/3,  2/3),  any  y  =  (y,  1  —  y)  is  a  best  reply.  For  (1/3, 2/3)  to  be  an 
ESS  we  therefore  need 

(!/3  2/3)A(1>y)>(,  1-,M  (,^) 

for  all  0  <  y  <  1  with  y  ^  1/3.  The  inequality  simplifies  (check!)  to  the  inequality 
(3y  —  l)2  <0,  which  never  holds.  Hence,  (1/3,  2/3)  is  not  an  ESS. 

We  now  investigate  the  replicator  dynamics.  The  expected  payoff  of  V  against 
(x,  1  —  x)  is  equal  to  3x  +  1  •  (1  —  x)  =  2x  +  1.  The  expected  payoff  of  W  against 
(x,  1  —  x)  is  equal  to  v  +  2  •  (1  —  v)  =  2—x.  The  average  payoff  is  x(2x  +  1)  +  (1  — 
x)(2  —  x)  =  3x2  —  2x  +  2.  Hence,  the  replicator  dynamics  is 

dx/dt  =  x(2x  +  1  —  (3x2  —  2x  +  2))  =  —  x(x  —  l)(3v  —  1)  . 

Figure  8.2  presents  the  phase  diagram,  which  shows  that  v  =  0  and  v  =  1  are 
stable  rest  points,  and  that  the  rest  point  v  =  1/3  is  not  stable,  in  accordance  with 
Proposition  8.5.  □ 
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Fig.  8.2  Replicator  dynamics  for  Example  8.7 


8.3  Asymmetric  Games 

The  evolutionary  approach  to  game  theory  is  not  necessarily  restricted  to  symmetric 
situations,  i.e.,  bimatrix  games  of  the  form  (A,  A7)  in  which  the  row  and  column 
players  play  identical  strategies.  In  biology  as  well  as  economics  one  can  find  many 
examples  of  asymmetric  situations.  Think,  for  instance,  of  two  different  species 
competing  about  territory  in  biology;  and  see  Problem  8.6  for  an  example  from 
economics. 

Consider  the  2  x  2-bimatrix  game 


(A,B) 


L  R 

U  ( 0,0  2,2\ 
D  y  1, 5  1,5)  ' 


Think  of  two  populations,  the  row  population  and  the  column  population.  In  each 
population  there  are  two  different  types:  U  and  D  in  the  row  population  and  L  and 
R  in  the  column  population.  Individuals  of  one  population  are  continuously  and 
randomly  matched  with  individuals  of  the  other  population,  and  we  are  interested 
again  in  the  development  of  the  population  shares.  To  start  with,  assume  the  shares 
of  U  and  D  types  in  the  row  population  are  v  and  1  —  x,  respectively,  and  the  shares 
of  L  and  R  types  in  the  column  population  are  y  and  1  —y.  The  expected  payoff  of  a 
U  type  individual  is  given  by: 


0  •  y  +  2  •  (1  -  y)  =  2-2 y  . 


For  a  D  type  individual  it  is 


1  -y  +  1  •  (1  -y)  =  1  . 


For  an  L  type  individual  it  is 


0  •  v  T  5  •  (1  —  x)  —  5  —  5x  . 
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Fig.  8.3  Phase  diagram  of 
the  asymmetric  evolutionary 
game 


1 


And  for  an  R  type  individual: 


2  •  x  +  5  •  (1  —  x)  =  5  —  3x  . 


The  average  of  the  row  types  is  therefore: 

■*[2(1  -y)\  +  (1  -*)  •  1 

and  the  replicator  dynamics  for  the  population  share  x(t)  of  U  individuals  is  given  by 

dx/dt  —  x[2(l  —  y)  —  x[2(l  —  y)]  —  (1  —  x)]  —  x(l  —  v)(l  —  2y)  .  (8.3) 

Here,  we  write  v  and  y  instead  of  x(t)  and  y(t).  Similarly  one  can  calculate  the 
replicator  dynamics  for  the  column  population  (check  this  result!): 

dy/dt  =  y(  1  -  y)(-2x)  .  (8.4) 

We  are  interested  in  the  rest  points  of  the  dynamic  system  described  by  Eqs.  (8.3) 
and  (8.4),  and,  in  particular,  by  the  stable  rest  points.  Figure  8.3  presents  a  diagram 
of  the  possible  values  of  v  and  y.  In  this  diagram,  the  black  lines  are  the  values  of  v 
and  y  for  which  the  derivative  in  (8.3)  is  equal  to  0,  i.e.,  for  which  the  row  population 
is  at  rest.  The  gray  lines  are  the  values  of  v  and  y  for  which  the  derivative  in  (8.4)  is 
equal  to  0:  there,  the  column  population  is  at  rest.  The  points  of  intersection  are  the 
points  where  the  whole  system  is  at  rest;  this  is  the  set 

{(0,y)  |  0  <  y  <  1}  U  {(1,0)}  U  {(1, 1)} . 

In  Fig.  8.3,  arrows  indicate  the  direction  in  which  x  and  y  move.  For  instance,  if 
\>y>  ^  and  0  <  v  <  1  we  have  dx/dt  <  0  and  dy/dt  <  0,  so  that  in  that  region 
v  as  well  as  y  decrease.  A  stable  rest  point  is  a  rest  point  such  that,  if  the  system 
is  slightly  disturbed  and  moves  to  some  point  close  to  the  rest  point  in  question, 
then  it  should  move  back  again  to  this  rest  point.  In  terms  of  the  arrows  in  Fig.  8.3 
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this  means  that  a  stable  rest  point  is  one  where  all  arrows  in  the  neighborhood  point 
towards  that  point.  It  is  obvious  that  in  our  example  the  point  (1 , 0)  is  the  only  such 
point.  So  the  situation  where  the  row  population  consists  only  of  U  type  individuals 
(x  =  1)  and  the  column  population  consists  only  of  R  type  individuals  (y  —  0)  is 
the  only  stable  situation  with  respect  to  the  replicator  dynamics. 

Is  there  a  relation  with  Nash  equilibrium?  One  can  check  (!)  that  the  set  of  Nash 
equilibria  in  this  example  is  the  set: 

{(U,R),(D,L)}U{(D,(q,l-q))  |  ?  <  q  <  1}  . 

So  the  stable  rest  point  ( U,R )  is  a  Nash  equilibrium.  Furthermore,  it  has  a  special 
characteristic,  namely,  it  is  the  only  strict  Nash  equilibrium  of  the  game.  A  strict 
Nash  equilibrium  in  a  game  is  a  Nash  equilibrium  where  each  player  not  only  does 
not  gain  but  in  fact  strictly  looses  by  deviating.  For  instance,  if  the  row  player 
deviates  from  U  in  the  Nash  equilibrium  ( U ,  R)  then  he  obtains  strictly  less  than 
2.  All  the  other  equilibria  in  this  game  do  not  have  this  property.  For  instance,  if 
the  column  player  deviates  from  L  to  R  in  the  Nash  equilibrium  (D,  L),  then  he  still 
obtains  5. 

The  observation  that  the  stable  rest  point  of  the  replicator  dynamics  coincides 
with  a  strict  Nash  equilibrium  is  not  a  coincidence.  The  following  proposition  is 
stated  here  without  a  proof. 

Proposition  8.8  In  a  2x2  bimatrix  game  a  pair  of  strategies  is  a  stable  rest  point 
of  the  replicator  dynamics  if  and  only  if  it  is  a  strict  Nash  equilibrium.  For  larger 
games,  any  stable  rest  point  of  the  replicator  dynamics  is  a  strict  Nash  equilibrium, 
but  the  converse  does  not  necessarily  hold. 

Remark  8.9  A  strict  Nash  equilibrium  in  a  bimatrix  game  must  be  a  pure  Nash 
equilibrium,  for  the  following  reason.  If  a  player  plays  two  or  more  pure  strategies 
with  positive  probability  in  a  Nash  equilibrium,  then  he  must  be  indifferent  between 
these  pure  strategies  and,  thus,  can  deviate  to  any  of  them  while  keeping  the  same 
payoff.  This  holds  true  in  any  arbitrary  game,  not  only  in  bimatrix  games.  □ 


8.4  Problems 


8.1.  Symmetric  Games 

Compute  the  evolutionary  stable  strategies  for  the  following  payoff  matrices  A. 


(a)  A  = 


(Prisoners’  Dilemma) 


(b)  A  = 


(Coordination  game) 
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8.2.  More  Symmetric  Games 

For  each  of  the  following  two  matrices,  determine  the  replicator  dynamics,  rest 
points  and  stable  rest  points,  and  evolutionary  stable  strategies.  Include  phase 
diagrams  for  the  replicator  dynamics.  For  the  evolutionary  stable  strategies,  provide 
independent  arguments  to  show  evolutionary  stability  by  using  Definition  8.4. 


8.3.  Asymmetric  Games 

For  each  of  the  following  two  asymmetric  situations  (i.e.,  row  and  column  pop¬ 
ulations  are  assumed  to  be  different  and  we  do  not  only  consider  symmetric 
population  shares),  determine  the  replicator  dynamics,  rest  points  and  stable  rest 
points,  including  phase  diagrams.  Also  determine  all  Nash  and  strict  Nash  equilibria. 


(a)  (A,  A7)  = 

(b)  (A, A7)  = 


(0,0  1,1\ 
\1,1  0,0/ 
(2,2  0,1\ 
U 0  o,oJ 


8.4.  More  Asymmetric  Games 

For  each  of  the  following  two  bimatrix  games,  determine  the  replicator  dynamics 
and  all  rest  points  and  stable  rest  points.  Also  compute  all  Nash  equilibria,  and 
discuss  the  relation  using  Proposition  8.8. 


(a)  (A,  B)  = 

(b)  (A,  B)  = 


(3,2  8,0\ 

W,0  6,2 ) 

(4,3  3,4\ 
V5,5  2,4 ) 


8.5.  Frogs  Call  For  Mates 

Consider  the  following  game  played  by  male  frogs  who  Call  or  Don’t  Call  their 
mates. 


Call  Don’t  Call 

Call  (  P  —  z,  P  —  z  m  —  z,  1  —  m 
Don’t  Call  y  1  —  m,  m  —  z  0,0 

The  payoffs  are  in  units  of  ‘fitness’ ,  measured  by  the  frog’s  offspring.  Here  z  denotes 
the  cost  of  Calling  (danger  of  becoming  prey,  danger  of  running  out  of  energy);  and 
m  is  the  probability  that  the  male  who  calls  in  a  pair  of  males,  the  other  of  whom 
is  not  calling,  gets  a  mate.  Typically,  m  .  Next,  if  no  male  calls  then  no  female 
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is  attracted,  and  if  both  call  returns  diminish  and  they  each  attract  P  females  with 

0  <  P  <  1. 

(a)  Show  that  there  are  several  possible  evolutionary  stable  strategies  for  this  game, 
depending  on  the  parameters  ( m ,  z,  P). 

(b)  Set  m  —  0.6  and  P  =  0.8  .  Find  values  for  z  for  each  of  the  following  situations: 
(i)  Don’t  Call  is  an  evolutionary  stable  strategy  (ESS);  (ii)  Call  is  an  ESS;  (iii)  A 
mixture  of  Call  and  Don’t  Call  is  an  ESS. 

(c)  Suppose  there  are  two  kinds  of  frogs  in  Frogs  Call  For  Mates.  Large  frogs  have  a 
larger  cost  of  calling  (zi)  than  do  small  frogs  (zi).  Determine  the  corresponding 
asymmetric  bimatrix  game.  Determine  the  possible  stable  rest  points  of  the 
replicator  dynamics. 

8.6.  Video  Market  Game 

Two  boundedly  rational  video  companies  are  playing  the  following  asymmetric 

game: 


Open  system  Lockout  system 

Open  system  /  6, 4  5, 5 

Lockout  system  y  9,1  10,0 

Company  I  (the  row  company)  has  to  decide  whether  to  have  an  open  system  or  a 
lockout  system.  Company  II  (the  column  company)  has  to  decide  whether  to  create 
its  own  system  or  copy  that  of  company  I.  What  is  a  rest  point  of  the  replicator 
dynamics  for  this  system? 


8.5  Notes 

Evolutionary  game  theory  originated  in  the  work  of  the  biologists  Maynard  Smith 
and  Price  (1973).  Taylor  and  Jonker  (1978)  and  Selten  (1983),  among  others, 
played  an  important  role  in  applying  the  developed  evolutionary  biological  concepts 
to  boundedly  rational  human  behavior,  and  in  establishing  the  connection  with 
dynamic  systems  and  with  game-theoretic  concepts  such  as  Nash  equilibrium. 
A  comprehensive  treatment  is  Weibull  (1995). 

The  original  definition  of  evolutionary  stable  strategy  (Chap.  15)  is  due  to 
Maynard  Smith  and  Price  (1973).  Taylor  and  Jonker  (1978)  introduced  the  replicator 
dynamics. 

For  more  economic  applications  of  asymmetric  evolutionary  games  see  for 
instance  Gardner  (1995).  In  the  literature  the  concept  of  evolutionary  stable  strategy 
is  extended  to  asymmetric  games.  See  Selten  (1980)  or  Hofbauer  and  Sigmund 
(1988)  for  details,  also  for  the  relation  between  stable  rest  points  and  strict  Nash 
equilibrium. 

Problems  8.5  and  8.6  are  taken  from  Gardner  (1995). 
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Cooperative  Games  with  Transferable  Utility 


9 


The  implicit  assumption  in  a  cooperative  game  is  that  players  can  form  coalitions 
and  make  binding  agreements  on  how  to  distribute  the  proceeds  of  these  coalitions. 
A  cooperative  game  is  more  abstract  than  a  noncooperative  game  in  the  sense  that 
strategies  are  not  explicitly  modelled:  rather,  the  game  describes  what  each  possible 
coalition  can  earn  by  cooperation.  In  a  cooperative  game  with  transferable  utility 
it  is  assumed  that  the  earnings  of  a  coalition  can  be  expressed  by  one  number. 
One  may  think  of  this  number  as  an  amount  of  money,  which  can  be  distributed 
among  the  players  in  any  conceivable  way — including  negative  payments — if  the 
coalition  is  actually  formed.  More  generally,  it  is  an  amount  of  utility  and  the 
implicit  assumption  is  that  it  makes  sense  to  transfer  this  utility  among  the  players — 
for  instance,  due  to  the  presence  of  a  medium  like  money,  assuming  that  individual 
utilities  can  be  expressed  in  monetary  terms. 

This  chapter  presents  a  first  acquaintance  with  the  theory  of  cooperative  games 
with  transferable  utility.  A  few  important  solution  concepts — the  core,  the  Shapley 
value,  and  the  nucleolus — are  briefly  discussed  in  Sects.  9. 2-9. 4.  We  start  with 
examples  and  preliminaries  in  Sect.  9.1. 


9.1  Examples  and  Preliminaries 

In  Chap.  1  we  have  seen  several  examples  of  cooperative  games  with  transferable 
utility:  the  three  cities  game,  a  glove  game,  a  permutation  game,  and  a  voting  game. 
For  the  stories  giving  rise  to  these  games  the  reader  is  referred  to  Sect.  1.3.4.  Here 
we  reconsider  the  resulting  games. 

In  the  three  cities  game,  cooperation  between  cities  leads  to  cost  savings 
expressed  in  amounts  of  money,  as  in  Table  9.1.  In  the  first  line  of  this  table  all 
possible  coalitions  are  listed.  It  important  to  note  that  the  term  ‘coalition’  is  used  for 
any  subset  of  the  set  of  players.  So  a  coalition  is  not  necessarily  formed.  The  empty 
subset  (empty  coalition)  has  been  added  for  convenience:  it  is  assigned  the  number 
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Table  9.1  The  three  cities 
game 


5 

0 

{1} 

{2} 

{3} 

{1,2} 

{1,3} 

{2,3} 

{1,2,3} 

v(S) 

0 

0 

0 

0 

90 

100 

120 

220 

Table  9.2  A  glove  game 


5 

0 

{1} 

{2} 

{3} 

{1,2} 

{1,3} 

{2,3} 

{1,2,3} 

v(S) 

0 

0 

0 

0 

0 

1 

1 

1 

Table  9.3  A  permutation 
game 


5 

0 

{1} 

{2} 

{3} 

{1,2} 

{1,3} 

{2,3} 

{1,2,3} 

v(S) 

0 

2 

5 

4 

14 

18 

9 

24 

0  by  convention.  The  numbers  in  the  second  line  of  the  table  are  called  the  ‘worths’ 
of  the  coalitions.  For  instance,  coalition  S  —  {1,2}  has  worth  90.  In  this  particular 
example,  90  are  the  costs  saved  by  cities  1  and  2  if  they  cooperate.  It  is  assumed  that 
this  amount  can  be  split  between  the  two  players  (cities)  if  the  coalition  is  actually 
formed:  that  is,  player  1  may  receive  x\  e  R  and  player  2  may  receive  X2  e  R  such 
that  x\  +  X2  —  90  or,  more  generally,  x\  +  X2  <  90. 

In  the  glove  game  in  Sect.  1.3.4  coalitions  may  make  pairs  of  gloves.  The  game  is 
described  in  Table  9.2.  In  this  game  the  worth  1  of  the  ‘grand  coalition’  { 1 , 2,  3},  for 
instance,  means  that  this  coalition  can  earn  1  by  producing  one  pair  of  gloves.  One 
can  think  of  this  number  as  expressing  the  monetary  value  of  this  pair  of  gloves. 
Alternatively,  one  can  think  of  one  pair  of  gloves  having  ‘utility’  equal  to  1.  Again, 
it  is  assumed  that  the  players  can  split  up  this  amount  in  any  way  they  like.  So  a 
possible  distribution  of  the  worth  of  the  grand  coalition  takes  the  form  (x\ ,  X2,  *2)  € 
M3  such  that  x\  +  X2  +  V3  <  1.  For  i  —  1, 2,  3,  the  number  xi  may  represent  the 
money  that  player  i  receives,  or  (if  nonnegative)  the  percentage  of  time  that  player  i 
is  allowed  to  wear  the  gloves. 

The  permutation  game  (dentist  game)  of  Sect.  1.3.4  is  reproduced  in  Table  9.3. 
In  this  game,  one  could  think  of  the  worth  of  a  coalition  as  expressing,  for  instance, 
savings  of  opportunity  costs  by  having  dentist  appointments  on  certain  days.  What 
is  important  is  that,  again,  these  worths  can  be  distributed  in  any  way  among  the 
players  of  the  coalitions. 

For  the  voting  game  related  to  the  UN  Security  Council,  a  table  could  be 
constructed  as  well,  but  this  table  would  be  huge:  there  are  215  =  32,768  possible 
coalitions  (cf.  Problem  9.1).  Therefore,  it  is  convenient  to  describe  this  game  as 
follows.  Let  the  permanent  members  be  the  players  1,...,5  and  let  the  other 
members  be  the  players  6, . . . ,  15.  Denote  by  A  =  {1,2,..., 15}  the  grand  coalition 
of  all  players  and  by  v  ( S )  the  worth  of  a  coalition  S  c  N.  Then 


1  if  {1,...,5}  c  S  and  \S\  >  9 
0  otherwise 


where  \S\  denotes  the  number  of  players  in  S.  In  this  case  the  number  1  indicates 
that  the  coalition  is  ‘winning’  and  the  number  0  that  the  coalition  is  ‘losing’. 
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In  analyzing  games  like  this  the  resulting  numbers — e.g.,  nonnegative  numbers 
x\, . . .  ,x\5  summing  to  1 — are  usually  interpreted  as  power  indices,  expressing  the 
power  of  a  player  in  some  way  or  another. 

We  summarize  the  concepts  introduced  informally  in  the  preceding  examples, 
formally  in  the  following  definition. 

Definition  9.1  A  cooperative  game  with  transferable  utility  or  TU-game  is  a  pair 
( N ,  v),  where  N  =  {1, . . . ,  n}  with  n  e  N  is  the  set  of  players ,  and  v  is  a  function 
assigning  to  each  coalition  S ,  i.e.,  to  each  subset  KiVa  real  number  v(S ),  such 
that  u(0)  =  0.  The  function  v  is  called  the  characteristic  function  and  v(S)  is  called 
the  worth  of  S.  The  coalition  N  is  called  the  grand  coalition.  A  payoff  distribution 
or  payoff  vector  for  coalition  S  is  a  vector  of  real  numbers  (xj)jes-  □ 

When  analyzing  a  TU-game  there  are  two  important  questions  to  answer:  which 
coalitions  are  formed;  and  how  are  the  worths  of  these  coalitions  distributed  among 
their  members?  In  this  chapter  we  assume  that  the  grand  coalition  is  formed  and 
we  concentrate  on  the  second  question.  This  is  less  restrictive  than  it  may  seem 
at  first  sight,  since  coalition  formation  depends,  naturally,  on  how  the  proceeds  of 
a  coalition  are  going  to  be  distributed  among  its  members.  Thus,  also  if  smaller 
coalitions  are  formed  the  distribution  question  has  to  be  considered  for  these 
coalitions. 


9.2  The  Core 


Consider  the  three  cities  game  in  Table  9.1,  suppose  that  the  grand  coalition  gets 
together,  and  suppose  that  there  is  a  proposal  x\  —  40,  v 2  =  40,  and  V3  =  140  for 
distribution  of  the  savings  v(N)  =  220  on  the  bargaining  table.  One  can  imagine, 
for  instance,  that  player  3  made  such  a  proposal.  In  that  case,  players  1  and  2  could 
protest  successfully,  since  they  can  save  u({l,2})  =  90  >  80  =  x\  +  v 2  without 
player  3.  We  express  this  by  saying  that  x  =  (x\,X2,xt,)  is  not  in  the  “core”  of 
this  game.  More  generally,  the  core  of  the  three  cities  game  is  the  set  of  payoff 
distributions  for  A  =  {1,2,3}  such  that  the  sum  of  the  payoffs  is  equal  to  v(N)  — 
220  and  each  nonempty  coalition  S  obtains  at  least  its  own  worth.  Thus,  it  is  the  set 


C  =  {(Vi,V2,  V3)  £  M3 


X\,X2,  X3  >  0, 


x\  +  X2  >  90,  x\  +  V3  >  100,  X2  +  V3  >  120, 


X\  T  X2  T  V3  —  220}  . 


To  obtain  a  better  idea  of  what  this  set  looks  like,  we  can  make  a  diagram.  Although 
C  is  a  subset  of  M3,  the  constraint  x\  +V2+V3  =  220  makes  that  the  set  C  is  contained 
in  a  two-dimensional  subset  of  M3,  i.c.,  the  plane  through  the  points  (220,0,0), 
(0,  220, 0),  and  (0, 0,  220).  The  triangle  formed  by  these  three  points  is  represented 
in  Fig.  9.1.  By  the  constraints  xt  >  0  for  every  i  =  1,2,  3,  the  set  C  must  be  a 
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Fig.  9.1  The  set  C  ( shaded ) 
is  the  core  of  the  three  cities 
game.  Line  segment  a 
corresponds  to  the  constraint 
jci  +  X2  >  90;  it  consists  of 
the  payoff  vectors  in  the 
triangle  withvi  +  X2  =  90  or, 
equivalently,  X3  =  130.  Line 
segment  b  corresponds  to  the 
constraint  x\  +V3  >  100,  and 
line  segment  c  corresponds  to 
the  constraint  X2  +  V3  >  120 


(220 


(0,220,0) 


subset  of  this  triangle.  The  set  C  is  further  restricted  by  the  three  constraints  for  the 
two-person  coalitions:  it  is  the  shaded  area  in  Fig.  9.1. 

Hence,  the  core  of  the  three  cities  game  is  the  polygon  and  its  inside  with  vertices 
(100, 120, 0),  (0, 120, 100),  (0,  90, 130),  (90,  0, 130),  and  (100, 0, 120). 

We  now  give  the  formal  definition  of  the  core  and  of  some  other  related  concepts. 
We  write  x(S )  :=  J2i<=sxi  f°r  a  Pay°ff  distribution  x  =  (x\, ...  ,xn)  g  W1  and  a 
nonempty  coalition  S  c  N  —  {1 , ,n}.  Hence,  v(S)  is  what  the  members  of  the 
coalition  S  obtain  together  if  the  payoff  vector  is  x. 

Definition  9.2  For  a  TU-game  ( N ,  v),  a  payoff  distribution  x  =  (x\, . . . ,  xn)  g 

W1  is 

•  efficient  ifv(TV)  =  v(N), 

•  individually  rational  if  X(  >  v({i})  for  all  i  G  N , 

•  coalitionally  rational  if  x(S)  >  v(S)  for  all  nonempty  coalitions  S. 

The  core  of  ( N ,  v )  is  the  set 

C(N,  v)  =  {x  G  Rn  |  x(N)  =  v(N)  and  x(S)  >  v(S)  for  all  0  ^  c  A^}  . 

Thus,  the  core  of  (N,  v )  is  the  set  of  all  efficient  and  coalitionally  rational  payoff 
distributions.  □ 

The  core  of  a  game  can  be  a  large  set,  as  in  the  three  cities  game;  a  small  set,  as 
in  the  glove  game  (see  Problem  9.2);  or  it  can  be  empty  (see  again  Problem  9.2). 
In  general,  core  elements  can  be  computed  by  linear  programming.  For  games  with 
two  or  three  players  the  core  can  be  computed  graphically,  as  we  did  for  the  three 
cities  game.  Sometimes,  the  core  can  be  computed  by  using  the  special  structure  of 
the  specific  game  under  consideration. 
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We  conclude  this  section  with  analyzing  the  core  of  the  general  glove  game  (cf. 
Problem  1.9). 

Example  9.3  In  the  general  glove  game,  there  are  l  >  0  players  who  own  a  left- 
hand  glove  and  r  >  0  players  who  own  a  right-hand  glove.  In  total  there  are  n 
players,  so  n  —  l  +  r.  The  coalition  N  of  all  players  is  the  grand  coalition.  The 
worth  of  a  coalition  S  is  equal  to  the  number  of  pairs  of  gloves  that  the  members 
of  the  coalition  can  make.  Hence,  it  is  equal  to  the  minimum  of  two  numbers:  the 
number  of  left-hand  glove  owners  in  S  and  the  number  of  right-hand  glove  owners  in 
S.  If  we  denote  the  set  of  all  left-hand  glove  owners  by  L  and  the  set  of  all  right-hand 
glove  owners  by  R ,  then  we  can  also  write  this  as 

v(S)  =  min{|SnL|,  \S  H  R\}  . 

Here,  S  H  Lis  the  intersection  of  S  and  L,  i.e.,  the  set  of  left-hand  glove  owners  in 
S ,  and  as  before  |  •  |  denotes  the  number  of  elements  in  a  set,  in  this  case  the  number 
of  players.  What  is  the  core  of  this  game? 

To  answer  this  question  we  cannot  just  make  a  diagram  as  before.  Even  if  we  take 
specific  numbers  for  l  and  r,  as  soon  as  t  +  r  is  larger  than  three  we  cannot  make 
a  picture.  (This  is  not  quite  true:  we  could  still  make  a  picture  for  four  players,  but 
that  would  be  a  three-dimensional  picture,  which  is  not  easy  to  draw.)  Therefore,  we 
have  to  argue  in  a  different  way. 

Let  us  assume,  first,  that  l  >  r.  Then  v  (N)  —  r.  we  can  use  the  above  formula  or 
simply  observe  that  the  grand  coalition  can  make  r  pairs.  Since  each  single  player 
can  make  zero  pairs,  we  also  have  v ({/})  =  0  for  each  player  i  e  N.  Hence,  for  a 
payoff  vector  x  =  (x\ , . . . ,  xn)  to  be  in  the  core,  we  already  need  x\  +  . . .  +  xn  —  r 
and  Xi  >  0  for  each  i  e  N.  Now  take  a  player  j  e  L,  that  is,  j  owns  a  left-hand  glove. 
Consider  the  coalition  S  that  consists  of  all  right-hand  glove  owners  and  of  at  least 
r  left-hand  glove  owners  but  not  player  j :  this  is  possible  since  i  >  r.  Then  we  still 
have  f(S')  =  r,  since  S  can  still  make  r  pairs,  but  this  means  that  the  members  of 
S  together  should  obtain  at  least  r.  In  turn,  this  implies  that  they  obtain  exactly  r, 
and  thus  player  j  obtains  zero:  Xj  —  0.  Since  this  argument  holds  for  every  arbitrary 
left-hand  glove  owner,  every  such  player  obtains  zero.  So  far  we  have  derived:  all 
right-hand  glove  owners  together  obtain  r,  each  one  of  them  obtains  at  least  zero, 
and  every  left-hand  glove  owner  obtains  zero. 

Now  let  i  be  a  right-hand  glove  owner  and  j  a  left-hand  glove  owner.  Then 
v({i,j})  =  1,  hence  xt  +  Xj  >  1.  Since  Xj  —  0,  as  already  established,  we  have 
Xi  >  1.  But  this  holds  for  every  right-hand  glove  owner.  Since  there  are  r  of  them 
and  together  they  obtain  r,  we  must  have  that  xt  =  1  for  every  right-hand  glove 
owner.  So  we  have  found  that  x  is  unique:  it  assigns  1  to  right-hand  glove  owners 
and  0  to  left-hand  glove  owners.  Thus,  we  have  found  that  if  x  is  in  the  core,  then  it 
can  only  be  this  specific  payoff  vector. 

Conversely,  this  payoff  vector  is  indeed  in  the  core.  All  players  together  receive  r, 
which  is  indeed  equal  to  v(N).  An  arbitrary  coalition  S  receives  an  amount  which  is 
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equal  to  the  number  of  right-hand  glove  owners  in  S :  this  is  indeed  at  least  equal  to 
the  worth  of  S,  i.e.,  the  number  of  glove  pairs  that  S  can  make. 

Altogether,  in  one  formula,  we  have  for  the  core  of  the  general  glove  game  with 

l  >  r. 


C(v )  =  {x}  with  x  £  R",  Xi  —  1  for  i  e  R  and  jq  =  0  for  i  £  L  . 

Similarly  we  find  for  r  >  i: 

C(v )  =  {x}  with  x  £  Mn,  xi  =  1  for  i  e  L  and  jq  =  0  for  i  £  R  . 

The  case  l  =  r  is  left  as  Problem  9.4.  □ 


9.3  The  Shapley  Value 

The  Shapley  value  is  a  solution  concept  for  TU-games  that  is  quite  different  from 
the  core.  Whereas  the  core  is  a  (possibly  empty)  set ,  the  Shapley  value  assigns  a 
unique  payoff  distribution  for  the  grand  coalition  to  every  TU-game.  The  Shapley 
value  is  not  so  much  based  on  strategic  considerations  but,  rather,  assigns  to  each 
player  his  “average  marginal  contribution”  in  the  game.  For  three-player  games  we 
already  explained  the  definition  of  the  Shapley  value  in  Chap.  1.  Here,  we  repeat 
and  extend  this  definition. 

Consider  again  the  three  cities  game  of  Table  9.1.  Imagine  a  setting  where 
the  players  enter  a  bargaining  room  one  by  one,  and  upon  entering  each  player 
demands  and  obtains  what  he  contributes  to  the  worth  of  the  coalition  present  in 
the  room.  Suppose  that  player  1  enters  first,  player  2  enters  next,  and  player  3 
enters  last.  Player  1  enters  an  empty  room  and  can  take  his  “marginal  contribution” 
u({l})  —  u(0)  =  0  —  0  =  0.  When  player  2  enters,  player  1  is  already  present, 
and  player  2  obtains  his  marginal  contribution  i?({l,  2})  —  f  ({1})  =  90  —  0  =  90. 
When,  finally,  player  3  enters,  then  the  coalition  { 1 , 2}  is  already  present.  So  player 
3  obtains  his  marginal  contribution  u({l,2,3})  —  u({l,2})  =  220  —  90  =  130. 
Hence,  this  procedure  results  in  the  payoff  distribution  (0,  90, 130),  which  is  called 
a  marginal  vector.  Of  course,  this  payoff  distribution  does  not  seem  fair  since  it 
depends  on  the  order  in  which  the  players  enter  the  room,  and  this  order  is  arbitrary: 
there  are  five  other  possible  orders.  The  Shapley  value  takes  the  marginal  vectors 
of  all  six  orders  into  consideration,  and  assigns  to  a  TU-game  their  average.  See 
Table  9.4. 

For  an  arbitrary  TU-game  ( N ,  v )  with  player  set  A  =  {1 , ...  ,n}  the  Shapley 
value  can  be  computed  in  the  same  way.  There  are  n  •  (n  —  1)  • . . .  •  2  •  1  —  n\  possible 
orders  of  the  players.  First  compute  the  marginal  vectors  corresponding  to  these  n\ 
different  orders,  and  then  take  the  average — that  is,  sum  all  marginal  vectors  and 
divide  the  result  by  n\ .  If  the  number  of  players  is  large,  then  this  is  a  huge  task. 
In  the  UN  security  council  voting  game  of  Sect.  9.1,  for  instance,  this  would  mean 
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Table  9.4  Computation  of 
the  Shapley  value  for  the 
three  cities  game.  The 

Shapley  value  is  obtained  by 

Order  of  entry 

1 

2 

3 

1,2,3 

0 

90 

130 

1,3,2 

0 

120 

100 

dividing  the  totals  of  the 

2,1,3 

90 

0 

130 

marginal  contributions  by  6 

2,3,  1 

100 

0 

120 

3,1,2 

100 

120 

0 

3L2,  1 

100 

120 

0 

Total 

390 

[450 

[480 

Shapley  value 

65 

75 

80 

computing  15!  >  13  x  1011  marginal  vectors.  Fortunately,  there  is  a  more  clever 
way  to  compute  the  total  marginal  contribution  of  a  player. 

For  instance,  let  (N,  v)  be  a  TU-game  with  ten  players.  Consider  player  7  and  the 
coalition  {3,  5,  9}.  The  marginal  contribution  i?({3,  5,  9,  7})  —  u({3,  5,  9})  accruing 
to  player  7  occurs  in  more  than  one  marginal  vector.  In  how  many  marginal  vectors 
does  it  occur?  To  compute  this,  note  that  first  players  3,5,  and  9  must  enter,  and  this 
can  happen  in  3!  different  orders.  Then  player  7  enters.  Finally,  the  other  six  players 
enter,  and  this  can  happen  in  6!  different  orders.  Therefore,  the  total  number  of 
marginal  vectors  in  which  player  7  obtains  the  marginal  contribution  v({ 3,  5,9,7})  — 
v ({3,  5,  9})  is  equal  to  3!  x  6!.  By  counting  in  this  way  the  number  of  computations 
is  greatly  reduced. 

We  now  repeat  this  argument  for  an  arbitrary  TU-game  (N,  v),  an  arbitrary  player 
i  e  N,  and  an  arbitrary  coalition  S  that  does  not  contain  player  i.  By  the  same 
argument  as  in  the  preceding  paragraph,  the  total  number  of  marginal  vectors  in 
which  player  i  receives  the  marginal  contribution  v(S  U  {/})  —  v(S)  is  equal  to  the 
number  of  different  orders  in  which  the  players  of  S  can  enter  first,  \S\ ! ,  multiplied 
by  the  number  of  different  orders  in  which  the  players  not  in  S  U  {/}  can  enter  after 
player  i,  which  is  (n  —  |S|  —  1)! .  Hence,  the  total  contribution  obtained  by  player 
i  by  entering  after  the  coalition  S  is  equal  to  \S\\(n  —  |S|  —  l)![i?(S  U  {/})  —  v(S)] . 
The  Shapley  value  for  player  i  is  then  obtained  by  summing  over  all  coalitions  S  not 
containing  player  i,  and  dividing  by  n\ .  In  fact,  we  use  this  alternative  computation 
as  the  definition  of  the  Shapley  value. 


Definition  9.4  The  Shapley  value  of  a  TU-game  (TV,  v)  is  denoted  by  <T>(TV,  v).  Its 
i- th  coordinate,  i.e.,  the  Shapley  value  payoff  to  player  i  e  N,  is  given  by 


*i(N,v)= 

S^N:  i£S 


S|!(w-  \S\  -  1): 


n: 


[v(S  U  {/})  -  v(S)] 


□ 
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Especially  for  larger  TU-games  it  is  easier  to  work  with  the  formula  in  Definition  9.4 
than  to  use  the  definition  based  on  marginal  vectors.  For  some  purposes,  however,  it 
is  easier  to  use  the  latter  definition  (Problem  9.7). 

The  Shapley  value  of  the  three  cities  game  is  an  element  of  the  core  of  that  game 
(check  this).  In  general,  however,  this  does  not  have  to  be  the  case  even  if  the  core 
is  nonempty  (Problem  9.8). 


Example  9.5  Consider  the  general  glove  game  of  Example  9.3.  We  assume  t  —  4 
and  r  —  2.  The  worth  v(N)  of  the  grand  coalition  is  equal  to  2.  First  note  that,  for 
reasons  of  symmetry,  in  the  Shapley  value  all  left-hand  glove  owners  receive  the 
same  payoff  and  also  all  right-hand  glove  owners  receive  the  same  payoff.  (This  is 
intuitive,  but  can  also  be  made  more  precise:  see  Problem  9.17  or  Chap.  16.)  Since 
the  total  payoff  is  2,  this  means  that  it  is  sufficient  to  compute  the  Shapley  value  of 
either  one  left-hand  glove  owner  or  one  right-hand  glove  owner:  if,  say,  the  payoff  of 
a  left-hand  glove  owner  in  the  Shapley  value  is  a,  then  every  left-hand  glove  owner 
receives  a  and  every  right-hand  glove  owner  receives  (2  —  4a) / 2.  Let  us  compute  a. 

Suppose  i  is  a  left-hand  glove  owner.  Then  i  makes  a  contribution  of  1  to  any 
coalition  with  strictly  less  left-hand  players  than  right-hand  players.  To  all  other 
coalitions  V s  contribution  is  zero,  so  we  do  not  have  to  take  those  into  consideration. 
Take  a  coalition  S  with  k  right-hand  glove  owners  and  j  left-hand  glove  owners, 
such  that  player  i  is  not  in  S  and  such  that  j  <  k ,  where  k  is  equal  to  1  or  2.  Then 
indeed  v(S  U  {/})  —  v(S)  =  1,  since  v(S)  =  j  and  v(S  U  {/})  =  j  +  1.  There 


are 


coalitions  with  k  right-hand  glove  owners  and  j  left-hand  glove 


owners.  Since  |5|  =  k  +  j ,  the  weight  of  S  as  in  the  formula  in  Definition 9.4  is 
(k  +  j)!(5  —  k  —j)\.  Hence,  summing  over  all  possible  values  of  k  and  j  we  obtain 


®i(N,  v)  =  a  = 


2  k—l 


k=  1  7=0 


(k  +  7) ! (5  -  k  —j)\ 
6! 


This  is  readily  computed  and  yields  a  —  2/15.  Hence,  every  left-hand  glove  owner 
receives  2/ 1 5  in  the  Shapley  value,  which  implies  that  for  the  two  right-hand  glove 
owners  2  —  4-  2/15  =  22/15  is  left.  This  implies 


v) 


2/15  if  i  is  a  left-hand  glove  owner 
11/15  if  i  is  a  right-hand  glove  owner. 


Observe  that  the  Shapley  value  of  this  game  is  not  in  the  core  (see  Example  9.3). 


‘In  genera1’  (j)  = 

from  a  set  of  p  elements. 


is  the  number  of  ways  in  which  we  can  choose  a  set  of  q  elements 
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The  same  arguments  can  be  used  to  compute  the  Shapley  value  for  any  number 
of  left-hand  and  right-hand  glove  owners.  If  t  =  r,  then  for  reasons  of  symmetry, 
all  players  receive  the  same  payoff  in  the  Shapley  value,  namely  1/2.  □ 

The  definition  of  the  Shapley  value  as  assigning  to  each  player  in  a  game  his  average 
marginal  contribution,  can  be  regarded  as  a  justification  of  this  solution  concept  by 
itself.  In  the  literature  there  are,  moreover,  a  number  of  axiomatic  characterizations 
of  the  Shapley  value.  In  an  axiomatic  characterization  one  proceeds  as  follows. 
Consider  an  arbitrary  map,  which  (like  the  Shapley  value)  assigns  to  each  game 
with  player  set  N  a  payoff  vector.  Next,  define  “reasonable”  properties  or  axioms 
for  this  map.  Such  axioms  limit  the  possible  maps  (i.e.,  solution  concepts),  and  if 
the  axioms  are  strong  enough,  they  admit  only  one  solution  concept.  This  so-called 
axiomatic  approach  is  common  in  cooperative  game  theory.  Problem  9.17  preludes 
to  this.  For  details,  see  Chap.  17. 


9.4  The  Nucleolus 

The  last  concept  we  discuss  in  this  introduction  to  TU-games  is  the  nucleolus.  Like 
the  Shapley  value  it  assigns  a  unique  payoff  distribution  to  a  game.  An  advantage 
compared  to  the  Shapley  value  is  that  the  nucleolus  assigns  a  payoff  distribution 
in  the  core  of  a  game,  provided  the  core  is  nonempty.  The  nucleolus  is  defined 
for  games  which  possess  the  following  property.  A  TU-game  ( N ,  v )  is  essential 
if  v(N)  >  J2ieN  ^ ({*’})•  For  an  essential  game  there  are  payoff  distributions  for 
the  grand  coalition  that  are  both  efficient  and  individually  rational.  Such  payoff 
distributions  are  called  imputations.  The  set 

l(N,  w)={xeRA'|  x(N)  =  v(N),  x,  >  t>  ({>'})  for  all  i  e  N} 

is  called  the  imputation  set  of  the  TU-game  (. N ,  v).  Hence,  a  game  (. N ,  v )  is  essential 
if  and  only  if  I(N,  v)  ^  0  (check!). 

Let  (. N ,  v )  be  an  essential  TU-game,  let  x  e  I(N,  v),  and  let  S  be  a  nonempty 
coalition  unequal  to  N.  The  excess  of  S  at  x,  denoted  by  e(S,  x),  is  defined  by 

e(S,x)  =  v(S)-x(S)  . 

The  excess  of  a  coalition  S  at  a  payoff  distribution  x  is,  thus,  the  difference  between 
what  S  can  acquire  on  its  own  and  what  it  receives  in  total  from  x.  The  excess  can 
be  seen  as  a  measure  of  the  dissatisfaction  of  the  coalition  S  with  the  imputation  x: 
the  smaller  the  total  payoff  of  S  at  x,  the  larger  e(S,  x).  In  particular,  if  this  excess  is 
positive  then  S  obtains  less  than  its  own  worth. 

In  words,  the  nucleolus  of  an  essential  TU-game  ( N ,  v )  is  defined  as  follows. 
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1.  First,  find  all  imputations  for  which  the  maximal  excess  among  all  coalitions 
(not  equal  to  N  or  the  empty  set)  is  as  small  as  possible.  If  there  is  a  unique  such 
imputation,  then  that  is  the  nucleolus. 

2.  If  not,  then  determine  those  coalitions  for  which  the  maximal  excess  found  in  (1) 
cannot  be  decreased  any  further.  Then  continue  with  the  remaining  coalitions 
and  among  the  imputations  found  in  (1)  find  those  imputations  for  which  the 
maximal  excess  among  these  remaining  coalitions  is  as  small  as  possible.  If  there 
is  a  unique  such  imputation,  then  that  is  the  nucleolus. 

3.  If  not,  then  determine  those  coalitions  for  which  the  maximal  excess  found  in  (2) 
cannot  be  decreased  any  further.  Then  continue  with  the  remaining  coalitions 
and  among  the  imputations  found  in  (2)  find  those  imputations  for  which  the 
maximal  excess  among  these  remaining  coalitions  is  as  small  as  possible.  If  there 
is  a  unique  such  imputation,  then  that  is  the  nucleolus. 

4.  Etc. 

Thus,  the  idea  behind  the  nucleolus  is  to  make  the  largest  dissatisfaction  as  small 
as  possible.  If  there  is  more  than  one  possibility  to  do  this,  then  we  also  make 
the  second  largest  dissatisfaction  as  small  as  possible,  and  so  on,  until  a  unique 
distribution  is  reached. 

A  formal  definition  of  the  nucleolus  can  be  found  in  Chap.  19.  Here  we  content 
ourselves  with  the  given  verbal  description  and  some  examples. 

Our  first  illustration  of  this  procedure  is  its  application  to  the  three  cities  game, 
reproduced  in  Table  9.5.  The  third  line  of  the  table  gives  the  excesses  at  the 
imputation  (70,  70,  80).  The  choice  of  this  particular  imputation  is  arbitrary:  we  use 
it  as  a  starting  point  to  find  the  nucleolus.  The  largest  excess  at  this  imputation  is 
—30,  namely  for  the  coalition  {2,3}.  Clearly,  we  can  decrease  this  excess  by  giving 
players  2  and  3  more  at  the  expense  of  player  1 .  Doing  so  implies  that  the  excesses 
of  {1, 2}  or  of  {1, 3}  or  of  both  will  increase.  These  excesses  are  equal  to  —50.  We 
can  increase  the  payoffs  of  players  2  and  3  by  6|  and  decrease  the  payoff  of  player 
1  by  2  •  6|  =  13|,  so  that  these  three  excesses  become  equal.  Thus  we  obtain  the 
imputation  (56|,  76|,  86|),  at  which  the  excesses  of  the  three  two-player  coalitions 
are  all  equal  to  — 43 1,  and  these  are  also  the  maximal  excesses.  Now  first  observe 
that  at  this  imputation  the  maximal  excess  is  as  small  as  possible.  This  follows  since 


Table  9.5  Heuristic  determination  of  the  nucleolus  of  the  three  cities  game 


s 

{1} 

{2} 

{3} 

{1,2} 

0,3} 

{2,3} 

{1,2,3} 

v(S) 

0 

0 

0 

90 

100 

120 

220 

e(S,  (70, 70, 80)) 

-70 

-70 

-80 

-50 

-50 

-30 

57 

CJ> 

OJ|KJ 

oc 
O' \ 

OJ|K> 

1 

O'! 

1 

O'! 

1 

oc 

O'! 

UJ|K> 

-43  i 

-43  i 

-43  i 
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the  sum  of  the  excesses  of  the  three  two-player  coalitions  at  any  imputation  must  be 
the  same,  namely  equal  to  —130,  as  follows  from 

^({1,2},  x)  +  e({l,  3},  x)  +  e({2,  3},  x)  =  u({l,2»  +  u({l,3»  +  u({2,3}) 

—  2(x\  +  X2  +  x3) 

=  310-2-220 


=  -130  . 


This  implies  that  none  of  these  excesses  can  be  decreased  without  increasing  at  least 
one  other  excess.  Second,  the  imputation  at  which  these  three  excesses  are  equal  is 
unique,  since  the  system 


90  —  X\  —  X2  —  100  —  X\  —  X3 
100  —  X\  —  X3  =  120  —  X2  —  X3 
X\  T  X2  T  x3  —  220 
xi,x2,x3  >  0 

r\  r\  r\ 

has  a  unique  solution — namely,  indeed,  (56|,76|,86|).So  this  imputation  must  be 
the  nucleolus  of  the  three  cities  game. 

This  example  suggests  that,  at  least  for  a  three  player  TU-game,  it  is  easy  to 
find  the  nucleolus,  namely  simply  by  equating  the  excesses  of  the  three  two-player 
coalitions.  Unfortunately,  this  is  erroneous.  It  works  if  the  worths  of  the  two- 
player  coalitions  are  large  relative  to  the  worths  of  the  single  player  coalitions,  but 
otherwise  it  may  fail  to  result  in  the  nucleolus.  Consider  the  three-player  TU-game 
in  Table  9.6,  which  is  identical  to  the  three  cities  game  except  that  now  u({l}  =  20. 
The  third  line  of  the  table  shows  the  excesses  at  (56|,  76|,  86|)  in  this  TU-game. 
(This  vector  is  still  an  imputation.)  The  maximal  excess  is  now  —  36 1  for  the  single¬ 
player  coalition  {1}.  Clearly,  (56|,76|,86|)  is  no  longer  the  nucleolus:  the  excess 
of  { 1 }  can  be  decreased  by  giving  player  1  more  at  the  expense  of  players  2  and/or 
3.  Suppose  we  equalize  the  excesses  of  {1}  and  {2,3}  by  solving  the  equation 
20  —  x\  —  1 20  —  x2  —  x3 .  Together  with  x\  +  x2  +  V3  =  220  this  yields  x\  —  60 
and  x2  +  V3  =  160.  Trying  the  imputation  (60,  75,  85),  obtained  by  taking  away 
the  same  amount  1 1  from  players  2  and  3,  yields  the  excesses  in  the  fourth  line  of 


Table  9.6  Heuristic  determination  of  the  nucleolus  in  the  three  cities  game  with  the  worth  of 
coalition  { 1 }  changed  to  20 


5 

{1} 

{2} 

{3} 

{1,2} 

{1,3} 

{2,3} 

{1,2,3} 

v(S) 

20 

0 

0 

90 

100 

120 

220 

e(S,  (56§,76§,86§)) 

-36  f 

-76  f 

-86  f 

-43  i 

-43  i 

-43 1 

e(S,  (60, 75, 85)) 

-40 

-75 

-85 

-45 

-45 

-40 
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Table  9.6.  We  claim  that  (60,  75,  85)  is  the  nucleolus  of  this  TU-game.  The  maximal 
excess  is  —40,  reached  by  the  coalitions  { 1 }  and  {2,3},  and  this  cannot  be  decreased: 
decreasing  the  excess  for  one  of  those  two  coalitions  implies  increasing  the  excess 
for  the  other  coalition.  Hence,  x\  has  to  be  equal  to  60  in  the  nucleolus.  The  second 
maximal  excess  is  —45,  reached  by  the  coalitions  {1,2}  and  {1,3}.  Since  x\  has 
already  been  fixed  at  60,  a  decrease  in  the  excess  for  one  of  these  two  coalitions 
implies  an  increase  of  the  excess  for  the  other  coalition.  Hence,  also  x 2  and  V3  are 
fixed,  at  75  and  85,  respectively. 

These  two  examples  indicate  that  it  may  not  be  easy  to  compute  the  nucleolus. 
For  three-player  games  the  heuristic  method  above  works  well.  In  general,  it  can  be 
computed  by  solving  a  series  of  linear  programs.  The  following  example  illustrates 
this  for  the  three-player  games  considered  above. 

Example  9.6  The  maximal  excess  at  the  nucleolus  of  the  three-cities  game  in 
Table  9.5  can  be  found  by  solving  the  following  linear  minimization  problem. 

Minimize  a  subject  to 


X\  T  X2  T  x3  —  220 
Xu  X2,  x3  >  0 
0  —  X\  <  a 
0  —  x2  <  a 

0  —  V3  <  a 
90  —  X\  —  x2  <  Oi 
100  —  x\  —  x3  <  a 
120  —  x2  —  x3  <  a 


Here,  a  is  the  maximal  excess  to  be  minimized.  The  first  two  constraints  make  sure 
that  a  is  minimized  over  the  set  of  imputations.  The  next  three  constraints  are  those 
for  the  excesses  of  the  single-player  coalitions,  and  the  last  three  constraints  are 
those  for  the  excesses  of  the  two-player  coalitions:  these  six  inequalities  ensure  that 
a  will  be  the  maximal  excess.  By  using  the  efficiency  constraint  x\  +  x2  +  V3  = 
220  we  can  rewrite  the  problem  into  a  minimization  problem  with  three  variables 
(a  and  for  instance  x\,  x2)  which  could  be  solved  graphically  (by  making  a  three- 
dimensional  diagram);  in  general,  however,  we  can  use  a  computer  program,  based 
on  for  instance  the  simplex  method,  to  solve  the  problem.  In  this  case,  from  the 
preceding  analysis  we  already  know  that  the  optimal  solution  of  the  problem  is 
a  —  —43^,  attained  at  a  unique  point  (xi,X2,x3)  =  (56  3,763,863). 
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For  the  modified  three-cities  game  in  Table  9.6  the  linear  program  becomes: 
Minimize  a  subject  to 


X\  T  X2  T  x3  —  220 
Xu  X2,  X3  >  0 
20  —  x\  <  a 
0  —  x2  <  Of 
0  —  x3  <  a 
90  —  X\  —  x2  <  Of 
100  —  x\  —  x3  <  a 
120  —  X2  —  X3  <  ot 

From  the  preceding  analysis  the  solution  of  this  problem  is  a  —  —40,  and  at  all 
points  at  which  this  value  is  attained  we  have  x\  —  60  and,  consequently,  x2  +  V3  = 
160.  To  determine  the  second  maximal  excess  we  need  to  solve  the  following  linear 
program: 

Minimize  a  subject  to 


x2  T  x3  —  160 
*2,  *3  >  0 
— x2  <  a 
—x3  <  a 
30  —  x2  <  ct 
40  —  V3  <  a 


The  optimal  solution  to  this  problem  is  a  =  —45,  attained  at  (x2,  X3)  =  (75, 85) — 
this  follows  from  the  preceding  analysis,  or  from  solving  the  problem  graphically 
after  reducing  it  to  a  problem  of  two  variables  using  the  constraint  x2  +  x3  —  160. 
Thus,  the  nucleolus  of  the  modified  three-cities  problem  is  (60,  75,  85),  as  found 
earlier.  □ 

Although  the  nucleolus  is  not  easy  to  compute  it  is  an  attractive  solution  for  the 
following  reasons.  It  assigns  a  unique  imputation  to  every  essential  game  and  if  a 
game  has  a  nonempty  core,  the  nucleolus  assigns  a  core  element. 

Proposition  9.7  Let  ( N ,  v )  be  a  game  with  nonempty  core.  Then  the  nucleolus  of 
( N ,  v)  is  a  payoff  distribution  in  the  core. 


164 


9  Cooperative  Games  with  Transferable  Utility 


Table  9.7  The  game  in  Example  9.8 


5 

{1} 

{2} 

{3} 

{1,2} 

0,3} 

{2,3} 

{1,2,3} 

v(S) 

0 

0 

0 

10 

0 

20 

20 

e(S,  (0, 10, 10)) 

0 

-10 

-10 

0 

-10 

0 

e(S,  (0,15,5)) 

0 

-15 

-5 

-5 

-5 

0 

Proof  Take  any  x  in  the  core  of  (TV,  v).  Then  for  every  nonempty  coalition  S  ^  N  we 
have  x(S)  >  v(S),  hence  e(S,  x)  =  v  (S)  —  x(S)  <  0.  Suppose  that  z  is  the  nucleolus 
of  (TV,  v).  Since  the  nucleolus  minimizes  the  maximal  excess  over  all  imputations, 
and  we  already  have  an  imputation,  namely  x,  at  which  all  excesses  and  thus  also 
the  maximal  excess  are  non-positive,  we  must  have  that  the  maximal  excess  at  z  is 
non-positive  and,  hence,  all  excesses  at  z  are  non-positive.  This  means  that  for  every 
nonempty  coalition  S  ^  TV  we  have  e(S,  z)  =  v(S)  —  z(S )  <  0,  hence  z(5)  >  v(S). 
But  this  means  that  z  is  in  the  core  of  (TV,  v).  ■ 

In  view  of  Proposition  9.7,  if  the  core  of  a  game  is  known  then  this  may  be  very 
helpful  in  finding  the  nucleolus  since  we  can  restrict  consideration  to  the  core.  We 
illustrate  this  by  the  next  examples. 

Example  9.8  Consider  the  three-person  game  in  Table  9.7.  This  game  has  a 
nonempty  core,  and  player  1  obtains  0  in  every  core  distribution.  Starting  with  the 
core  distribution  (0, 10, 10),  we  find  that  the  maximal  excess  is  equal  to  0,  reached 
for  the  coalitions  {1},  {1,2},  and  {2,  3}.  Clearly,  this  excess  cannot  be  decreased 
any  further  since  a  decrease  of  the  excess  for  {1}  implies  an  increase  for  {2,  3}, 
and  conversely.  Hence,  if  z  is  the  nucleolus,  then  z\  —  0  and  zi  +  Z3  =  20.  Note, 
however,  that  the  excess  for  { 1 , 2}  can  be  decreased  by  increasing  the  payoff  for 
player  2  at  the  expense  of  player  3.  For  the  payoff  distribution  (0, 15,  5),  we  find 
that  the  second  maximal  excess  is  reached  by  {1,2},  {1,3},  and  {3}.  We  cannot 
decrease  the  excess  of  { 1 , 2}  without  increasing  it  for  { 1 , 3}  and  {3},  given  that  z\  is 
already  fixed  at  0.  Hence,  we  have  obtained  the  nucleolus:  z  =  (0, 15,  5).  □ 

Example  9.9  Consider  again  the  general  glove  game  of  Example  9.3.  For  l  >  r, 
the  core  of  this  game  consists  of  the  unique  payoff  distribution  where  the  left-hand 
glove  owners  obtain  0  and  the  right-hand  glove  owners  obtain  1.  By  Proposition  9.7, 
this  is  also  the  nucleolus  of  this  game.  □ 

We  conclude  with  an  example  illustrating  again  the  core,  Shapley  value  and 
nucleolus  of  a  game. 

Example  9.10  Consider  the  following  six-player  cooperative  game.  The  player  set 
is  TV  =  {1, . . . ,  6}.  A  coalition  S  has  worth  1  in  exactly  two  cases:  either  it  contains 
player  1  and  at  least  one  other  player,  or  it  is  the  coalition  {2, . . . ,  6}.  All  other 
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coalitions  have  worth  0.  (Such  a  game  is  called  an  apex  game ,  player  1  is  the  apex 
player  and  the  other  players  are  called  minor  players.) 

First  observe  that  the  core  of  this  game  is  empty.  To  see  this,  suppose  x  = 
(*1 , . . . ,  *6)  is  in  the  core.  Then  x\  +  . . .  +  V5  >  i?({l, . . . ,  5})  =  1.  Since 
X6  >  u({6})  =  0  and  x\  +  . . .  +  xe  =  v(N)  —  1,  we  must  have  x\  +  . . .  +  V5  =  1 
and  *6  =  0.  Similarly,  one  derives  X2  =  . . .  =  X5  =  0.  Hence,  X2  +  . . .  +  x^  —  0, 
but  then  1  =  i?({2, . . . ,  6})  >  X2  +  . . .  +  xe,  so  that  the  core  constraint  for  coalition 
{2, . . . ,  6}  is  violated.  From  this  contradiction,  we  conclude  that  the  core  of  this 
game  is  empty. 

To  find  the  nucleolus,  for  reasons  of  symmetry  we  may  assume  that  it  is  of  the 
form  (1  —  5a,  a, ...  ,a),  where  0<af<l/5in  order  to  make  it  an  imputation. 
The  excess  of  a  coalition  of  the  form  {1,  /}  for  every  i  e  {2, . . . ,  6}  is  equal  to 
1  —  (1  —  5af)  —  a  —  \a,  and  the  excess  of  the  coalition  {2, . . . ,  6}  is  1  —  5a. 
Clearly,  these  will  be  the  maximal  excesses.  By  increasing  a  the  excesses  of  the 
{ 1 ,  /}  coalitions  will  increase  and  the  excess  of  the  coalition  {2, . . . ,  6}  will  decrease; 
by  decreasing  a  the  effects  will  be  opposite.  Therefore,  we  find  the  nucleolus  by 
equating  these  excesses.  Setting  \a  —  1  —  5a  yields  a  —  1/9,  hence  the  nucleolus 
is  (4/9, 1/9,...,  1/9). 

To  find  the  Shapley  value,  we  use  Definition  9.4.  (There  are  6!  =  720  possible 
orderings  of  the  six  players,  so  listing  all  of  them  is  an  inefficient  method  to  compute 
the  Shapley  value.)  It  is  sufficient  to  compute  the  Shapley  value  for  player  2  (for 
instance)  since  then  we  also  know  the  Shapley  value  for  the  other  players.  Player  2 
makes  a  nonzero  contribution,  equal  to  1,  in  exactly  two  cases:  either  to  S  =  {1}  or 
to  S  =  {3, ... ,  6}.  So  we  obtain 


<W,  v) 


11(6-1-1)! 

6! 


+ 


4!  (6  —  4—1)! 
6! 


Thus,  the  Shapley  value  of  this  game  is 


1 

15  ' 


®(N,  v)  =  (1  -  5  • 


1  1 

15’  F5  ’ 


2  1  1 
3 ’  F5’  ’  F5 


□ 


9.5  Problems 

9.1.  Number  of  Coalitions 

Show  that  a  set  of  n  e  N  elements  has  2n  different  subsets. 

9.2.  Computing  the  Core 

(a)  Compute  the  core  of  the  glove  game  of  Table  9.2  by  making  a  diagram. 

(b)  Compute  the  core  of  the  dentist  game  of  Table  9.3  by  making  a  diagram. 

(c)  Compute  the  core  of  the  UN  security  council  voting  game  in  Sect.  9.1. 
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9.3.  The  Core  of  a  Two-Person  Game 

Consider  the  two-person  game  ({1,2}),  v)  given  by  n({l})  =  a ,  n({2})  =  h ,  and 
u({l,  2})  =  c,  where  a,b,c  £  R.  Give  a  necessary  and  sufficient  condition  on  a , 
and  c  for  the  core  of  ({1,2},  v )  to  be  nonempty.  Make  a  diagram  and  compute  the 
core. 

9.4.  The  Core  of  the  General  Glove  Game 

Compute  the  core  of  the  general  glove  game  in  Example  9.3  for  the  case  where  the 
numbers  of  left-hand  glove  owners  and  right-hand  glove  owners  are  equal.  Is  the 
Shapley  value  (cf.  Example  9.5)  in  the  core? 

9.5.  A  Condition  for  Nonemptiness  of  the  Core  of  a  Three-Person  Game 

Let  ({1, 2,  3},  v)  be  a  three-person  game  which  has  a  nonempty  core.  Show  that 
2u({l,  2,  3})  >  i?({l,  2})  +  i?({l,  3})  +  v({2,  3}).  (Hint:  Take  a  core  element  x  = 
(x\ ,  X2,  xf)  and  write  down  the  core  constraints.) 

9.6.  Non-monotonicity  of  the  Core 

Consider  the  following  four-person  game:  v({i})  —  0  for  every  i  —  1,...,4, 
u({l,2})  =  f({3,4})  =  0,  v(S)  =  1  for  all  other  two-person  coalitions  and  for 
all  three-person  coalitions,  and  v(N)  —  2. 

(a)  Show  that  C(N,  v)  =  {(a,  a,  1  —  a,  1  —  o')  £  M4  |  0  <  a  <  1}. 

(b)  Consider  the  game  (N,  v')  equal  to  (N,  v)  except  for  i/({l,  3, 4})  =  2.  Show 
that  the  core  of  (N,  v')  consists  of  a  single  element.  What  about  the  payoff  to 
player  1  if  core  elements  in  (N,  v )  and  (N,  v')  are  compared?  Conclude  that  the 
core  is  not  “monotonic”  (consider  player  1). 

9.7.  Efficiency  of  the  Shapley  Value 

Let  (N,  v )  be  an  arbitrary  TU-game.  Show  that  the  Shapley  value  0(Af,  v)  is 
efficient.  [Hint:  take  an  order  i\,  h, . . . ,  in  of  the  players  and  show  that  the  sum 
of  the  coordinates  of  the  corresponding  marginal  vector  is  equal  to  v  (N) ;  use  this  to 
conclude  that  0(iV,  v)  is  efficient.] 

9.8.  Computing  the  Shapley  Value 

(a)  Compute  the  Shapley  value  of  the  glove  game  of  Table  9.2.  Is  it  an  element  of 
the  core? 

(b)  Compute  the  Shapley  value  of  the  dentist  game  of  Table  9.3.  Is  it  an  element  of 
the  core? 

(c)  Compute  the  Shapley  value  of  the  UN  security  council  voting  game  in  Sect.  9.1. 
(Hint:  observe  the — more  or  less — obvious  fact  that  the  Shapley  value  assigns 
the  same  payoff  to  all  permanent  members  and  also  to  all  nonpermanent 
members.  Use  the  formula  in  Definition  9.4.)  Is  it  an  element  of  the  core? 
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9.9.  The  Shapley  Value  and  the  Core 

For  every  real  number  a  the  three-player  TU-game  va  is  given  by:  =  0  for 

i  =  1,2,3,  va({l,2})  =  3,  va({l,  3})  =  2,  va({2 ,  3})  =  1,  va({l,  2,  3})  =  a. 

(a)  Determine  the  minimal  value  of  a  so  that  the  TU-game  va  has  a  nonempty  core. 

(b)  Calculate  the  Shapley  value  of  va  for  a  —  6. 

(c)  Determine  the  minimal  value  of  a  so  that  the  Shapley  value  of  va  is  a  core 
distribution. 

9.10.  Shapley  Value  in  a  Two-Player  Game 

Let  (N,  v )  be  a  two-player  TU-game,  i.e.,  N  =  {1,2}.  Compute  the  Shapley  value 
(expressed  in  u ({ 1 }),  u({2}),  and  i?({l,  2})),  and  show  that  it  is  in  the  core  of  the 
game  provided  the  core  is  nonempty.  Make  a  diagram. 

9.11.  Computing  the  Nucleolus 

(a)  Compute  the  nucleolus  of  the  glove  game  of  Table  9.2. 

(b)  Compute  the  nucleolus  of  the  dentist  game  of  Table  9.3. 

(c)  Compute  the  nucleolus  of  the  UN  security  council  voting  game  in  Sect.  9.1. 
(Hint:  use  Proposition 9.7.) 

(d)  Compute  the  nucleolus  of  the  games  (N,  v )  and  (TV,  v')  in  Problem  9.6. 

9.12.  Nucleolus  of  Two-Player  Games 

Let  (TV,  v)  be  an  essential  two-player  TU-game.  Compute  the  nucleolus. 

9.13.  Computing  the  Core,  the  Shapley  Value,  and  the  Nucleolus 

(a)  Compute  the  Shapley  value  and  the  nucleolus  in  the  three-player  TU-game  given 
by:  v({i})  =  1  for  i  =  1,2,3,  v({l,2})  =  2,  v({l,  3»  =  3,  v({2,3»  =  4, 
i?({l,  2,  3})  =  6.  Is  the  Shapley  value  a  core  element  in  this  game? 

(b)  Compute  the  core  of  this  game.  Make  a  picture. 

(c)  Suppose  we  increase  u({l}).  What  is  the  maximal  value  of  f  ({1})  such  that  the 
game  still  has  a  nonempty  core? 

9.14.  Voting  (1) 

Suppose  in  Parliament  there  are  four  parties  A,B,C,D  with  numbers  of  votes  equal 
to  40,  30,  20, 10,  respectively.  To  pass  any  law  a  two-third  majority  is  needed. 

(a)  Formulate  this  situation  as  a  four-person  cooperative  game  where  winning 
coalitions  have  worth  1  and  losing  coalitions  worth  0.  Determine  the  Shapley 
value  of  this  game. 

(b)  Determine  also  the  core  and  the  nucleolus  of  this  game. 
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9.15.  Voting  (2) 

A  voting  committee  consists  of  five  members:  members  1  and  2  belong  to  party  I, 
while  members  3,  4,  and  5  belong  to  party  II.  In  order  to  pass  a  decision  at  least 
a  weak  majority  of  each  party  (that  is,  at  least  50  %  of  the  votes  of  each  party)  is 
required.  A  coalition  that  has  a  weak  majority  of  both  parties  is  called  winning.  We 
model  this  situation  as  a  so-called  simple  game :  winning  coalitions  obtain  worth  1, 
all  other  coalitions  worth  0. 

(a)  Show  that  there  are  six  winning  coalitions  of  minimal  size:  list  all  of  them. 

(b)  Use  your  answer  to  (a)  to  give  a  concise  description  of  the  game  (i.e.,  without 
listing  all  32  coalitions). 

(c)  Compute  the  Shapley  value  of  this  game.  According  to  the  Shapley  value,  which 
players  (members)  are  most  powerful? 

(d)  Compute  the  nucleolus  of  this  game.  According  to  the  nucleolus,  which  players 
(members)  are  most  powerful? 

(e)  Compute  the  core  of  this  game. 

9.16.  Two  Buyers  and  a  Seller 

Players  1  and  2  are  buyers,  while  player  3  is  a  seller.  The  seller  owns  an  object  that 
is  worth  nothing  to  him,  but  has  value  1  for  buyer  1  and  value  2  for  buyer  2.  These 
are  the  prices  that  the  buyers  are  willing  to  pay  to  the  seller  in  order  to  get  the  object. 

(a)  Model  this  situation  as  a  three-person  TU-game,  where  the  worth  of  each 
coalition  is  the  maximal  surplus  it  can  create  by  a  transaction  between  a  buyer 
and  the  seller. 

(b)  Compute  the  core  of  this  game. 

(c)  Compute  the  Shapley  value  of  this  game.  Is  it  in  the  core? 

(d)  Compute  the  nucleolus  of  this  game. 

9.17.  Properties  of  the  Shapley  Value 

The  properties  of  the  Shapley  value  described  in  (a)-(c)  below  are  called  symmetry, 
additivity,  and  dummy  property,  respectively.  It  can  be  shown  (Chap.  17)  that 
the  Shapley  value  is  the  unique  solution  concept  that  assigns  exactly  one  payoff 
vector  to  each  TU-game  and  has  these  three  properties  together  with  efficiency  (cf. 
Problem  9.7).  In  other  words,  a  solution  concept  has  these  four  properties  if,  and 
only  if,  it  is  the  Shapley  value.  In  this  exercise  you  are  asked  to  show  the  “easy”  part 
of  this  statement,  namely  the  if-part.  (Hint:  in  each  case,  decide  which  of  the  two 
formulas  for  the  Shapley  value — the  one  in  Definition  9.4  or  the  formula  based  on 
marginal  vectors — is  most  convenient  to  use.) 

(a)  Let  ( N ,  v)  be  a  TU-game,  and  suppose  players  i  and  j  are  symmetric  in  this 
game,  i.e.,  v(S  U  {/})  —  v(S  U  {/})  for  all  coalitions  S  which  do  not  contain  i 
and  j.  Show  that  i  and  j  obtain  the  same  payoff  from  the  Shapley  value. 
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(b)  Let  (N,  v)  and  (N,  w)  be  two  TU-games  with  the  same  player  set  N.  Define  the 
sum  of  these  TU-games  as  the  TU-game  with  player  set  N  where  the  worth  of 
each  coalition  S  is  given  by  v(S)  +  w(S);  denote  this  TU-game  by  (N,  v  +  w). 
Show  that  the  Shapley  value  assigns  to  this  sum  TU-game  the  payoff  vector 
which  is  the  sum  of  the  Shapley  values  of  (TV,  v )  and  (TV,  w). 

(c)  Call  player  i  a  dummy  in  the  TU-game  (TV,  v)  if  v(S  U  {/})  =  v(S)  +  v({i})  for 
every  coalition  S  to  which  player  i  does  not  belong.  Show  that  the  Shapley  value 
assigns  exactly  the  payoff  v({i})  to  player  i. 


9.6  Notes 

For  a  more  advanced  treatment  of  cooperative  game  theory,  see  Chaps.  16-20. 
Games  with  nonempty  cores  were  characterized  in  Bondareva  (1962)  and  Shapley 
(1967),  see  Chap.  16. 

The  Shapley  value  was  introduced  and  axiomatically  characterized  in  Shapley 
(1953) — see  Chap.  17. 

Imputations  were  first  introduced  by  von  Neumann  and  Morgenstern 
(1944/1947).  They  introduced  cooperative  games  in  order  to  cope  with  multi-person 
zero-sum  games,  which  in  general  do  not  have  a  value. 

The  nucleolus  was  introduced  in  Schmeidler  (1969).  The  nucleolus  is  similar  in 
spirit  to  the  main  principle  of  distributive  justice  proposed  in  Rawls  (1971),  namely 
to  maximize  the  lot  of  the  worst  off  people  in  society. 

The  arguments  used  above  to  show  that  a  particular  imputation  is  indeed  the 
nucleolus  implicitly  use  a  general  property  of  the  nucleolus  called  the  Kohlberg 
criterion.  See  Chap.  19  for  a  detailed  study  of  the  nucleolus. 

Problem  9.6  is  taken  from  Moulin  (1988). 
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The  common  features  of  a  cooperative  game  model — such  as  the  model  of  a 
game  with  transferable  utility  in  Chap.  9 — include:  the  abstraction  from  a  detailed 
description  of  the  strategic  possibilities  of  a  player;  instead,  a  detailed  description 
of  what  players  and  coalitions  can  attain  in  terms  of  outcomes  or  utilities;  solution 
concepts  based  on  strategic  considerations  and/or  considerations  of  fairness,  equity, 
efficiency,  etc.;  if  possible,  an  axiomatic  characterization  of  such  solution  concepts. 
For  instance,  one  can  argue  that  the  core  for  TU-games  is  based  on  strategic 
considerations  whereas  the  Shapley  value  is  based  on  a  combination  of  efficiency 
and  symmetry  or  fairness  with  respect  to  contributions.  The  latter  is  made  precise 
by  an  axiomatic  characterization  as  in  Problem  9.17. 

In  this  chapter  a  few  other  cooperative  game  models  are  discussed:  bargaining 
problems  in  Sect.  10.1,  exchange  economies  in  Sect.  10.2,  matching  problems  in 
Sect.  10.3,  and  house  exchange  in  Sect.  10.4. 


10.1  Bargaining  Problems 

An  example  of  a  bargaining  problem  is  the  division  problem  in  Sect.  1.3.5.  A 
noncooperative,  strategic  approach  to  such  a  bargaining  problem  can  be  found  in 
Sect.  6.7,  see  also  Problems  6.16  and  6.17.  In  this  section  we  treat  the  bargaining 
problem  from  a  cooperative,  axiomatic  perspective.  Surprisingly,  there  is  a  close 
relation  between  this  approach  and  the  strategic  approach,  as  we  will  see.  In 
Sect.  10.1.1  we  discuss  the  Nash  bargaining  solution  and  in  Sect.  10.1.2  its  relation 
with  the  Rubinstein  bargaining  procedure  of  Sect.  6.7. 
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1 0.1 .1  The  Nash  Bargaining  Solution 

We  start  with  the  definition  of  a  two-person  bargaining  problem. 

Definition  10.1  A  two-person  bargaining  problem  is  a  pair  (S,  d ),  where 
(/)  S  c  M* 2  is  a  convex,  closed  and  bounded  set,2 

(ii)  d  =  (d\,d2)  G  S  such  that  there  is  some  point  x  =  (x\, X2)  G  S  with  vi  >  rfi 
and^2  >  d2. 


S  is  the  feasible  set  and  d  is  the  disagreement  point  □ 

The  interpretation  of  a  bargaining  problem  ( S ,  d)  is  as  follows.  The  two  players 
bargain  over  the  feasible  outcomes  in  S.  If  they  reach  an  agreement  x  =  (x\,  X2)  g  S , 
then  player  1  receives  utility  x\  and  player  2  receives  utility  X2.  If  they  do  not  reach 
an  agreement,  then  the  game  ends  in  the  disagreement  point  d,  yielding  utility  d\ 
to  player  1  and  r/2  to  player  2.  This  is  an  interpretation,  and  the  actual  bargaining 
procedure  is  not  spelled  out. 

For  the  example  in  Sect.  1.3.5,  the  feasible  set  and  the  disagreement  point  are 
given  by 


S  =  {x  e  M2 


0  <  x\,x2  <  l,  x2  < 


—  x\ },  d\  —  d2  —  0  . 


See  also  Fig.  1.7.  In  general,  a  bargaining  problem  may  look  as  in  Fig.  10.1.  The  set 
of  all  such  bargaining  problems  is  denoted  by  B. 


Fig.  10.1  A  two-person 
bargaining  problem 


!We  restrict  attention  here  to  two-person  bargaining  problems.  For  n-person  bargaining  problems 
and,  more  generally,  NTU-games,  see  the  Notes  section  at  the  end  of  the  chapter  and  Chap.  21. 

2  A  subset  ofRk  is  convex  if  with  each  pair  of  points  in  the  set  also  the  line  segment  connecting  these 
points  is  in  the  set.  A  set  is  closed  if  it  contains  its  boundary  or,  equivalently,  if  for  every  sequence 
of  points  in  the  set  that  converges  to  a  point  that  limit  point  is  also  in  the  set.  It  is  bounded  if  there 
is  a  number  M  >  0  such  that  | jc,-  |  <  M  for  all  points  x  in  the  set  and  all  coordinates  i. 
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Fig.  10.2  Illustration  of  the  four  conditions  (‘axioms’)  determining  the  Nash  bargaining 
solution — cf.  Theorem  10.2.  In  (a)  the  Pareto  optimal  subset  of  S  is  the  thick  black  curve.  The 
bargaining  problem  (, S ,  d)  in  (b)  is  symmetric,  and  symmetry  of  F  means  that  F  should  assign  a 
point  on  the  thick  black  line  segment.  In  (c),  which  illustrates  scale  covariance,  we  took  d  to  be  the 
origin,  and  T  results  from  S  by  multiplying  all  first  coordinates  by  2:  then  scale  covariance  implies 
that  F\(T,  d)  =  2  F\(S,  d).  The  independence  of  irrelevant  alternatives  axiom  is  illustrated  in  (d) 


We  consider  the  following  question:  for  any  given  bargaining  problem  (S,  d), 
what  is  a  good  compromise?  We  answer  this  question  by  looking  for  a  map 
F  :  B  ->  M2  which  assigns  a  feasible  point  to  every  bargaining  problem, 
i.e.,  satisfies  F(S,  d)  e  S  for  every  ( S ,  d)  e  B.  Such  a  map  is  called  a  (two- 
person)  bargaining  solution.  According  to  Nash  (1950),  a  bargaining  solution  should 
satisfy  four  conditions,  namely:  Pareto  optimality,  symmetry,  scale  covariance,  and 
independence  of  irrelevant  alternatives.  We  discuss  each  of  these  conditions  in 
detail.  The  conditions  are  illustrated  in  Fig.  10.2a-d. 

For  a  bargaining  problem  (S,  d)  e  B ,  the  Pareto  optimal  points  of  S  are  those 
where  the  utility  of  no  player  can  be  increased  without  decreasing  the  utility  of  the 
other  player.  Formally, 

P(S)  —  {x  e  S  |  for  all  y  £  S  with  yi  >  x\,y2  >  X2,  we  have  y  =  x} 

is  the  Pareto  optimal  (sub)set  of  S.  The  bargaining  solution  F  is  Pareto  optimal  if 
F(S,  d)  e  P(S )  for  all  ( S ,  d)  e  B.  Hence,  a  Pareto  optimal  bargaining  solution 
assigns  a  Pareto  optimal  point  to  each  bargaining  problem.  See  Fig.  10.2a  for  an 
illustration. 
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A  bargaining  problem  (, S ,  d)  G  B  is  symmetric  if  d\  —  d2  and  if  S  is  symmetric 
with  respect  to  the  45° -line  through  d,  i.e.,  if 

S  =  {(x2,xi)  e  M2  |  (xi,x2)  g  S}  . 

In  a  symmetric  bargaining  problem  there  is  no  way  to  distinguish  between  the 
players  other  than  by  the  arbitrary  choice  of  axes.  A  bargaining  solution  is 
symmetric  if  F\(S,  d)  =  F2(S,  d)  for  each  symmetric  bargaining  problem  ( S ,  d)  G 
B.  Hence,  a  symmetric  bargaining  solution  assigns  the  same  utility  to  each  player  in 
a  symmetric  bargaining  problem.  See  Fig.  10.2b. 

Observe  that,  for  a  symmetric  bargaining  problem  ( S ,  d),  Pareto  optimality  and 
symmetry  of  F  would  completely  determine  the  solution  point  F(S,  d),  since  there 
is  a  unique  symmetric  Pareto  optimal  point  in  S. 

The  condition  of  scale  covariance  says  that  a  bargaining  solution  should  not 
depend  on  the  choice  of  the  origin  or  on  a  positive  multiplicative  factor  in  the 
utilities.  For  instance,  in  the  wine  division  problem  in  Sect.  1.3.5,  it  should  not 
matter  if  the  utility  functions  were  u\(a)  —  a\Ot  +  b\  and  u2(a)  —  a2*Ja  +  b2 , 
where  a\,a2,b\,b2  G  R  with  a\ ,  a2  >  0.  Saying  that  this  should  not  matter  means 
that  the  final  outcome  of  the  bargaining  problem,  the  division  of  the  wine,  should 
not  depend  on  this.  One  can  think  of  u\,  u2  expressing  the  same  preferences  about 
wine  as  u\ ,  u2  in  different  units.  Formally,  a  bargaining  solution  F  is  scale  covariant 
if  for  all  ( S ,  d)  G  B  and  all  a\,a2,b\,b2  G  R  with  a\ ,  a2  >  0  we  have: 


F  {{{a\X\  +  b\ ,  a2x2  +  b2)  e  M2  |  (vi ,  x2 )  G  S},  ( a\d\  +  b\ ,  a2d2  +  b2)^j 


—  (a\Fi(S,  d)  -\-  b i,  a2F2 (S,  d)  +  b2 )  . 


For  a  simple  case,  this  condition  is  illustrated  in  Fig.  10.2c. 

The  final  condition  is  regarded  as  the  most  controversial  one.  Consider  a 
bargaining  problem  ( S ,  d)  with  solution  outcome  z  =  F(S,  d)  G  S.  In  a  sense, 
z  can  be  regarded  as  the  best  compromise  in  S  according  to  F.  Now  consider  a 
smaller  bargaining  problem  ( T ,  d)  with  T  c  S  and  z  G  72  Since  z  was  the  best 
compromise  in  S ,  it  is  should  certainly  be  regarded  as  the  best  compromise  in  T:  z  is 
available  in  T  and  every  point  of  T  is  also  available  in  S.  Thus,  we  should  conclude 
that  F(T,  d)  =  z  =  F(S,  d).  As  a  less  abstract  example,  suppose  that  in  the  wine 
division  problem  the  wine  is  split  fifty-fifty,  with  utilities  (1/2,  yf  1/2).  Suppose 
now  that  no  player  wants  to  drink  more  than  3/4  liter  of  wine:  more  wine  does  not 
increase  utility.  In  that  case,  the  new  feasible  set  is 


T  =  {x  G  M2 


0  <  x\  <3/4,  0  <  x2  <  3/4,  x2  <  y/l  —x\}  . 


3 The  usual  assumption  is  that  the  utility  functions  are  expected  utility  functions,  which  uniquely 
represent  preferences  up  to  choice  of  origin  and  scale. 
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According  to  the  argument  above,  the  wine  should  still  be  split  fifty-fifty:  T  c  S 
and  (1/2,  y/l/2)  e  T.  This  may  seem  reasonable  but  it  is  not  hard  to  change 
the  example  in  such  a  way  that  the  argument  is,  at  the  least,  debatable.  For 
instance, suppose  that  player  1  still  wants  to  drink  as  much  as  possible  but  player 
2  does  not  want  to  drink  more  than  1/2  L.  In  that  case,  the  feasible  set  becomes 


T  =  {x  €  ]R2 


0  <  XI  <  1 ,  0  <  X2  <  y/l/2,  X2  <  y/l-X\}  , 


and  we  would  still  split  the  wine  fifty-fifty.  In  this  case  player  2  would  obtain 
his  maximal  feasible  utility,  and  (1/2,  -y/l/2)  no  longer  seems  a  reasonable 
compromise  since  only  player  1  makes  a  concession. 

Formally,  a  bargaining  solution  F  is  independent  of  irrelevant  alternatives  if  for 
all  (i S ,  d),  (T,  d)  e  B  with  T  c  S  and  F(S,  d)  e  T,  we  have  F(T,  d)  =  F(S,  d).  See 
Fig.  10. 2d  for  an  illustration. 

The  theorem  below  says  that  these  four  conditions  determine  a  unique  bargaining 
solution  FNash,  defined  as  follows.  For  (, S ,  d)  e  B,  FNash(S,  d)  is  equal  to  the  unique 
point  z  e  S  with  Zi  >  di  for  i  —  1,2  and  such  that 


(z\  —  d\)(z2  ~  df)  >  (vi  —  d \)(x2  —  di)  for  all  x  e  S  with  Xi  >  di,i  —  1,2. 


The  solution  FNash  is  called  the  Nash  bargaining  solution.  The  result  of  Nash  is  as 
follows. 


Theorem  10.2  The  Nash  bargaining  solution  FNash  is  the  unique  bargaining  solu¬ 
tion  which  is  Pareto  optimal,  symmetric,  scale  covariant,  and  independent  of 
irrelevant  alternatives. 


For  a  proof  of  this  theorem  and  the  fact  that  FNash  is  well  defined — i.e.,  the  point  z 
above  exists  and  is  unique — see  Chap.  21. 


Example  10.3  In  this  example  we  illustrate  the  role  of  the  conditions  in  Theo¬ 
rem  10.2.  In  fact,  we  show  how  the  proof  of  this  theorem  (cf.  Chap.  21)  works  in 
an  example.  Consider  the  bargaining  problem  (S,  d),  where  S  —  {{x\,X2)  e  M2  | 
0<vi  <2,  0<V2  <  4  —  x\}  and  d  =  (0, 1).  The  Nash  bargaining  solution 
outcome  is  obtained  by  solving  the  problem  maxo<Xl<2  x\  (3  —  v2),  which  yields 
the  point  (1,3).  Alternatively,  consider  the  bargaining  problem  (S',  d;),  obtained  by 
subtracting  1  from  the  second  coordinates  of  the  points  in  S,  including  d,  yielding 


S'  =  {(x\,x2)  e  M2 


0  <  x\  <  2,  —  1  <  x2  <  3 


X2},  d' =  (0,0). 


Next,  consider  the  bargaining  problem  (S" ,  d"),  obtained  from  (S',  d;)  by  dividing 
all  second  coordinates  by  2,  yielding 


1  3 

0  <  x\  <  2,  —  <  x2  < - 

_  i  _  ,  2  “  “  2 


S"  —  {(x\,X2)  G  M2 
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Fig.  10.3  Illustrating 
Example  10.3 


(-1,3) 


(3, 


-1) 


The  Pareto  optimal  boundary  of  S"  is  described  by  the  function  f(x\)  =  \  —  \£{ 
for  0  <  x\  <  2.  At  x\  —  1  the  derivative  of  this  function  is  equal  to  —1,  so  that 
the  straight  line  through  the  point  (1,1)  with  slope  —1  is  tangential  to  S",  i.e.,  the 
set  S"  is  below  this  line.  Now  consider  the  bargaining  problem  ( T ,  (0,  0))  with  T 
the  triangle  and  inside  with  vertices  (-1,-1),  (3,-1),  and  (—1,3);  see  Fig.  10.3. 
The  bargaining  problem  (T,  (0,  0))  is  symmetric,  so  that  by  symmetry  and  Pareto 
optimality  of  the  Nash  bargaining  solution,  the  outcome  is  the  point  (1,1).  This 
point  is  also  in  S",  and  moreover,  S"  is  a  subset  of  T,  so  that  by  independence  of 
irrelevant  alternatives  the  point  (1,  1)  is  also  the  Nash  bargaining  solution  outcome 
of  (S",  (0,  0)).  By  scale  covariance,  this  implies  that  the  Nash  bargaining  solution 
outcome  of  (S' ,  (0, 0))  is  the  point  (1,2).  Again  by  scale  covariance,  we  obtain  that 
the  Nash  bargaining  solution  outcome  of  (S,  (0,  1))  is  the  point  (1,3).  We  have 
reached  this  result  by  using  only  the  properties  of  the  Nash  bargaining  solution 
in  Theorem  10.2,  and  not  the  formula.  Observe  that  the  result  is  in  accordance  with 
what  we  established  by  direct  computation.  □ 


1 0.1 .2  Relation  with  the  Rubinstein  Bargaining  Procedure 

In  the  Rubinstein  bargaining  procedure  the  players  make  alternating  offers.  See 
Sect.  6.7.2  for  a  detailed  discussion  of  this  noncooperative  game,  and  Prob¬ 
lem  6.17(d)  for  the  application  to  the  wine  division  problem  of  Sect.  1.3.5.  Here, 
we  use  this  example  to  illustrate  the  relation  with  the  Nash  bargaining  solution. 

The  Nash  bargaining  solution  assigns  to  this  bargaining  problem  the  point  z  = 
(2/3,  71/3).  This  means  that  player  1  obtains  2/3  of  the  wine  and  player  2  obtains 
1/3.  According  to  the  Rubinstein  infinite  horizon  bargaining  game  with  discount 
factor  0  <  8  <  1  the  players  make  proposals  x  =  (x\,  )  £  P(S)  and  y  =  (y\,y2)  £ 

P(S )  such  that 


*2  =  <ty2,  y\  =  8x i  , 


(10.1) 
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and  these  proposals  are  accepted  in  (subgame  perfect)  equilibrium.  From  (10.1)  we 
derive 


=  X2  =  8y2  =  8  \f  1  -  y\  =  8  ^  1  -  8x\ 


hence  1  —  x\  —  82(1  —  8x\),  which  implies 


_  1  -  82  _  1  +  8 
Xl  ~  l-P  ~  1  +  8  +  S2  ' 

If  we  let  8  increase  to  1 ,  we  obtain 


lim 

a-*i  1  +  8  +  82 


2 

3  ’ 


which  coincides  with  the  payoff  to  player  1  according  to  the  Nash  bargaining 
solution.  This  is  not  a  coincidence.  It  follows,  in  general,  from  (10.1)  that 


(*i  ~  d\)(x2  -  d2)  =  xix2  =  (y\/8){8y2)  =  yiy2  =  Oi  —  d\)(y2  d2) 


hence  the  Rubinstein  proposals  x  and  y  have  the  same  ‘Nash  product’ .  For  our  wine 
division  example,  the  proposals  x  and  y  for  8  =  0.5  are  represented  in  Fig.  10.4. 

As  8  increases  to  1,  this  level  curve  shifts  up  until  it  passes  through  the  point 
z  —  FNash(S,  d ),  since  this  point  maximizes  the  product  x\x2  on  the  set  S :  see  again 
Fig.  10.4. 


Fig.  10.4  The  wine  division 
problem.  The  disagreement 
point  is  the  origin,  and  z  is  the 
Nash  bargaining  solution 
outcome.  The  points  x  and  y 
are  the  proposals  of  players  1 
and  2,  respectively,  in  the 
subgame  perfect  equilibrium 
of  the  Rubinstein  bargaining 
game  for  8  =  0.5 


1 


U  2 


3 
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We  conclude  that  the  subgame  perfect  equilibrium  payoffs  of  the  infinite  horizon 
Rubinstein  bargaining  game  converge  to  the  Nash  bargaining  solution  outcome  as 
the  discount  factor  8  approaches  1 . 

Example  10.4  In  the  bargaining  game  (S' ,  (0,  0))  of  Example  10.3  the  Rubinstein 
proposals  are  determined  by  the  equations  in  (10.1)  together  with  —  3  —  x \  and 
yi  =  3  -  y\,  yielding 

3  —  X[  =  X2  —  Sy2  —  8(3  —  yf)  =  5(3  —  82xf) 


hence 


V  l-S3 


1  +  8  +  82 


For  8  approaching  1  we  obtain 


lim 

«->i 


/  3 

V  1  +  5  +  S2 


which  is  the  Nash  bargaining  solution  payoff  to  player  1. 


□ 


1 0.2  Exchange  Economies 

In  an  exchange  economy  with  n  agents  and  k  goods,  each  agent  is  endowed  with 
a  bundle  of  goods.  Each  agent  has  preferences  over  different  bundles  of  goods, 
expressed  by  some  utility  function  over  these  bundles.  By  exchanging  goods  among 
each  other,  it  is  in  general  possible  to  increase  the  utilities  of  all  agents.  One  way 
to  arrange  this  exchange  is  to  introduce  prices.  For  given  prices  the  endowment  of 
each  agent  represents  the  agent’s  income,  which  can  be  spent  on  buying  a  bundle 
of  the  goods  that  maximizes  the  agent’s  utility.  If  prices  are  such  that  the  market 
for  each  good  clears — total  demand  is  equal  to  total  endowment — while  each  agent 
maximizes  utility,  then  the  prices  are  in  equilibrium:  such  an  equilibrium  is  called 
Walrasian  or  competitive  equilibrium.  Alternatively,  reallocations  of  the  goods  can 
be  considered  which  are  in  the  core  of  the  exchange  economy.  A  reallocation  of  the 
total  endowment  is  in  the  core  of  the  exchange  economy  if  no  coalition  of  agents  can 
improve  the  utilities  of  its  members  by,  instead,  reallocating  the  total  endowment  of 
its  own  members  among  each  other.  It  is  well  known  that  a  competitive  equilibrium 
allocation  is  an  example  of  a  core  allocation. 

This  section  is  a  first  acquaintance  with  exchange  economies.  Attention  is 
restricted  to  exchange  economies  with  two  agents  and  two  goods.  We  work  out 
an  example  of  such  an  economy.  Some  variations  are  considered  in  Problem  10.4. 

There  are  two  agents,  A  and  B ,  and  two  goods,  1  and  2.  Agent  A  has  an 
endowment  eA  =  (ef,  e A)  e  of  the  goods,  and  a  utility  function  uA  :  ->  M, 
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representing  the  preferences  of  A  over  bundles  of  goods.  Similarly,  agent  B  has  an 
endowment  eB  —  (eB,  eB)  £  of  the  goods,  and  a  utility  function  uB  :  Mq_  ->  R. 
(We  use  superscripts  to  denote  the  agents  and  subscripts  to  denote  the  goods.)  This 
is  a  complete  description  of  the  exchange  economy. 

For  our  example  we  take  eA  =  (2,3),  eB  —  (4, 1),  uA(x\,X2)  —  x\x2  and 
uB(x i ,  X2)  —  x\x\.  Hence,  the  total  endowment  in  the  economy  is  e  =  (6,4),  and  the 
purpose  of  the  exchange  is  to  reallocate  this  bundle  of  goods  such  that  both  agents 
are  better  off. 

Let  p  =  (p\,p2)  be  a  vector  of  positive  prices  of  the  goods.  Given  these  prices, 
both  agents  want  to  maximize  their  utilities.  Agent  A  has  an  income  of  p\e^  +  P2 e%, 
i.e.,  the  monetary  value  of  his  endowment.  Then  agent  A  solves  the  maximization 
problem 


maximize  uA(x\ ,  X2) 

subject  to  p\X\  +  P2X2  —  p\e\  +  p2ej,  x\,x2  >  0  . 


(10.2) 


The  income  constraint  is  called  the  budget  equation.  The  solution  of  this  maxi¬ 
mization  problem  is  a  bundle  xA(p)  =  (v^(p),  v^(p)),  called  agent  A’s  demand 
function.  Maximization  problem  (10.2)  is  called  the  consumer  problem  (of  agent 
A).  Similarly,  agent  B’ s  consumer  problem  is: 

maximize  uB(x\ ,  X2)  (10  3) 

subject  to p\X\  +  P2X2  —  p\eB  +  p2eB,  x\,x2  >  0  . 


For  our  example,  (10.2)  becomes 

maximize  x\  X2 

subject  to p\X\  +  P2X2  —  2pi  +  3p2 ,  x\,X2  >  0  , 


which  can  be  solved  by  using  Lagrange’s  method  or  by  substitution.  By  using  the 
latter  method  the  problem  reduces  to 

maximize x\  (( 2pi  +  3 p2  -p\Xi)/p2) 

subject  to  x\  >0  and  2p\  +  3p2  —  p\X\  >  0.  Setting  the  derivative  with  respect  to 
x\  equal  to  0  yields 


2p\  +  3p2  -p\xA 
P2  ) 


4R^  :=  {x  =  (x\,X2)  £  R2  |  jci,  X2  >  0}. 
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which  after  some  simplifications  yields  the  demand  function  x\  —  xf(p)  = 
(4pi  +  6p2)/3p\.  By  using  the  budget  equation,  v^(p)  =  (2p\  +  3p2)/3p2. 
Similarly,  solving  (10.3)  for  our  example  yields  vf(p)  =  (4p\  +  p2)/3p\  and 
vf(p)  =  (8pi  +  2p2)/3p2  (check  this!). 

The  prices  p  are  Walrasian  equilibrium  prices  if  the  markets  for  both  goods  clear. 
For  the  general  model,  this  means  that  v f  (p)  (p)  =  ef  +  ef  and  v f  (p)  +xf  (p)  = 

ef  +  ef.  For  the  example,  this  means 

(4p\  +  6p2)/3pi  +  (4/7i  +  p2)/3pi  =  6  and 

(2pi  +  3p2)/3p2  +  (8pi  +  2p2)/3p2  =  4  . 

Both  equations  result  in  the  same  condition,  namely  I0p\  —  lp2  —  0.  That  there 
is  only  one  equation  left  is  no  coincidence,  since  prices  are  only  relative,  as  is 
easily  seen  from  the  budget  equations.  In  fact,  the  prices  represent  the  rate  of 
exchange  between  the  two  goods,  and  are  meaningful  even  if  money  does  not 
exist  in  the  economy.  Thus,  p  =  (7,  10)  (or  any  positive  multiple  thereof)  are  the 
equilibrium  prices  in  this  exchange  economy.  The  associated  equilibrium  demands 
are  xA(7, 10)  =  (88/21,22/15)  andx5(7, 10)  =  (38/21,38/15). 

We  now  turn  to  the  core  of  an  exchange  economy.  A  reallocation  of  the  total 
endowments  is  in  the  core  if  no  coalition  can  improve  upon  it.  This  definition  is 
in  the  spirit  as  the  corresponding  definition  for  TU-games  (Definition  9.2).  In  a 
two-person  exchange  economy,  there  are  only  three  coalitions  (excluding  the  empty 
coalition),  namely  {A},  { B },  and  {A,  B}.  Consider  an  allocation  (xA,xB)  with  xf  + 
xf  =  ef  -h  ef  and  xf  +  xf  =  ef  +  ef .  To  prevent  that  agents  A  or  B  can  improve 
upon  (xA ,  x5)  we  need  that 

uA(xA)  >  uA(eA),  uB(xB )  >  uB(eB )  ,  (10.4) 

which  are  the  individual  rationality  constraints.  To  prevent  that  the  grand  coalition 
{A,  B}  can  improve  upon  (xA,xB)  we  need  that 

For  no  (yA,  yB)  with  yf  +  yf  =  ef  +  ef  and  yf  +  yf  —  ef  +  ef  we  have: 
uA(yA)  >  uA(xA)  and  uB(yB)  >  uB(xB )  with  at  least  one  inequality  strict. 

(10.5) 

In  words,  (10.5)  says  that  there  should  be  no  other  reallocation  of  the  total 
endowments  such  that  no  agent  is  worse  off  and  at  least  one  agent  is  strictly  better 
off.  This  is  the  efficiency  or  Pareto  optimality  constraint. 

We  apply  (10.4)  and  (10.5)  to  our  example.  The  individual  rationality  constraints 
are 


(xf)2xf  >  12,  xf (xf)2  >  4  . 
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The  Pareto  optimal  allocations,  satisfying  (10.5),  can  be  computed  as  follows.  Fix 
the  utility  level  of  one  of  the  agents,  say  B ,  and  maximize  the  utility  of  A  subject 
to  the  utility  level  of  B  being  fixed.  By  varying  the  fixed  utility  level  of  B  we  find 
all  Pareto  optimal  allocations.  In  the  example,  we  solve  the  following  maximization 
problem  for  c  e  R: 

maximize  (4)24 

subject  to  xf  +  xf  =  6,  xf  +  xf  —  4,  xf  (xf)2  =  c,  xf  ,xf  ,xf  ,xf  >  0  . 

By  substitution  this  problem  reduces  to 

maximize  (4)24 

subject  to  (6  —  xf  )(4  —  xf  )2  =  c,  xf ,  xf  >  0  . 

The  associated  Lagrange  function  is  (xf)2xf  —  A  [(6  —  xf)(4  —  xf)2  —  c\  and  the 
first-order  conditions  are 


2xfx^  +  A(4  —  xf)2  =  0,  (xf)2  +  2A(6  —  xf)(4  —  xf)  =  0  . 


Extracting  A  from  both  equations  and  simplifying  yields 


4xf 

24-3xf  ' 


Thus,  for  any  value  of  xf  between  0  and  6  this  equation  returns  the  corresponding 
value  of  xf ,  resulting  in  a  Pareto  optimal  allocation  with  xf  =  6— xf  and  xf  =  4— xf . 

It  is  straightforward  to  check  by  substitution  that  the  Walrasian  equilibrium 
allocation  xA  (7, 10)  =  (88/21, 22/15)  and  xB (7, 10)  =  (38/21, 38/15)  found 
above,  is  Pareto  optimal.  This  is  no  coincidence:  the  First  Welfare  Theorem  states 
that  in  an  exchange  economy  such  as  the  one  under  consideration,  a  Walrasian 
equilibrium  allocation  is  Pareto  optimal. 

Combining  the  individual  rationality  constraint  for  agent  A  with  the  Pareto 
optimality  constraint  yields  4(xf)3/(24  —  3xf)  >  12,  which  holds  for  xf  larger 
than  approximately  3.45.  For  agent  B ,  similarly,  the  individual  rationality  and  Pareto 
optimality  constraints  imply 
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Fig.  10.5  The  contract  curve  is  the  curve  through  c  and  c' .  The  point  c  is  the  point  of  intersection 
of  the  contract  curve  and  the  indifference  curve  of  agent  A  through  the  endowment  point  e.  The 
point  c'  is  the  point  of  intersection  of  the  contract  curve  and  the  indifference  curve  of  agent  B 
through  the  endowment  point  e.  The  core  consists  of  the  allocations  on  the  contract  curve  between 
c  and  c' .  The  straight  line  (‘budget  line’)  through  e  is  the  graph  of  the  budget  equation  for  A  at  the 
equilibrium  prices,  i.e.,  lx\  +  10x2  =  44,  and  its  point  of  intersection  with  the  contract  curve,  w, 
is  the  Walrasian  equilibrium  allocation.  At  this  point  the  indifference  curves  of  the  two  agents  are 
both  tangential  to  the  budget  line 


which  holds  for  smaller  than  approximately  4.88.  Hence,  the  core  of  the  exchange 
economy  in  the  example  is,  approximately,  the  set 


{(xf  ,xf)  e  M4  |  3.45  <  <  4.88, 

_  4x\  XB  _  £  _  r 4  B  —  A  _  yAl 

a2  ~  24—3xf  ’  A1  _  U  ’  a2  ~  H 


Clearly,  the  Walrasian  equilibrium  allocation  is  in  the  core,  since  3.45  <  88/21  < 
4.88,  and  also  this  holds  more  generally.  Thus,  decentralization  of  the  reallocation 
process  through  prices  leads  to  an  allocation  that  is  in  the  core. 

For  an  exchange  economy  with  two  agents  and  two  goods  a  very  useful  pictorial 
device  is  the  Edgeworth  box ,  see  Fig.  10.5.  The  Edgeworth  box  consists  of  all 
possible  reallocations  of  the  two  goods.  The  origin  for  agent  A  is  the  South  West 
corner  and  the  origin  for  agent  B  the  North  East  corner.  In  the  diagram,  the 
indifference  curves  of  the  agents  through  the  endowment  point  are  plotted,  as  well 
as  the  contract  curve ,  i.e.,  the  set  of  Pareto  optimal  allocations.  The  core  is  the 
subset  of  the  contract  curve  between  the  indifference  curves  of  the  agents  through 
the  endowment  point. 


10.3  Matching  Problems 

In  a  matching  problem  there  is  a  group  of  agents  that  have  to  form  couples. 
Examples  are:  students  who  have  to  be  coupled  with  schools;  hospitals  that  have 
to  be  coupled  with  doctors;  workers  who  have  to  be  coupled  with  firms;  men  who 
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Table  10.1  A  matching 
problem 


m\ 

m  2 

ra3 

W\ 

w2 

w3 

w2 

w  1 

w  1 

ni\ 

m2 

mi 

W\ 

w2 

w2 

m3 

m\ 

m3 

w3 

m2 

ra3 

m2 

have  to  be  coupled  with  women;  etc.  In  this  section  we  consider  so-called  one-to-one 
matching  problems. 

The  agents  are  divided  in  two  equally  large  (finite  and  nonempty)  sets,  denoted 
M  and  W.  Each  agent  in  M  has  a  strict  preference  over  those  agents  in  W  which 
he  prefers  over  staying  single.  Similarly,  each  agent  in  W  has  a  strict  preference 
over  those  agents  in  M  which  she  prefers  over  staying  single.  In  such  a  matching 
problem ,  a  matching  assigns  to  each  agent  in  M  at  most  one  agent  in  W,  and  vice 
versa;  thus,  no  two  agents  in  M  are  assigned  the  same  agent  in  W9  and  vice  versa. 

Such  matching  problems  are  also  called  marriage  problems ,  and  the  agents  of  M 
and  W  are  called  men  and  women ,  respectively.  While  the  problem  may  indeed  refer 
to  the  ‘marriage  market’,  this  terminology  is  of  course  adopted  for  convenience. 
Other  examples  are  matching  tasks  and  people,  or  matching  rooms  and  people. 

As  an  example,  consider  the  matching  problem  in  Table  10.1.  The  set  of  men 
is  M  —  {mi, m2, m3}  and  the  set  of  women  is  VV7  =  {wi,W2,W3}.  The  columns 
in  the  table  represent  the  preferences.  For  instance  m\  prefers  W2  over  w\  over 
staying  single,  but  prefers  staying  single  over  W3.  An  example  of  a  matching  in 
this  particular  matching  problem  is  (mi,wi),  (m3,W2),  m2,  W3,  meaning  that  m\ 
is  married  to  w\  and  m3  to  W2,  while  m2  and  W3  stay  single.  Observe  that  this 
matching  does  not  seem  very  ‘stable’ :  m\  and  W2  would  prefer  to  be  married  to  each 
other  instead  of  to  their  partners  in  the  given  matching.  Moreover,  m2  and  W3  would 
prefer  to  be  married  to  each  other  instead  of  being  single.  Also,  for  instance,  any 
matching  in  which  m\  would  be  married  to  W3  would  not  be  plausible,  since  m\ 
would  prefer  to  stay  single. 

The  obvious  way  to  formalize  these  considerations  is  to  require  that  a  matching 
should  be  in  the  core  of  the  matching  problem.  A  matching  is  in  the  core  of  there 
is  no  subgroup  (coalition)  of  men  and/or  women  who  can  do  better  by  marrying  (or 
staying  single)  among  each  other.  For  a  matching  to  be  in  the  core,  the  following 
two  requirements  are  certainly  necessary: 

(cl)  each  person  prefers  his/her  partner  over  being  single; 

(c2)  if  m  e  M  and  w  e  W  are  not  matched  to  each  other,  then  it  is  not  the  case  that 
both  m  prefers  w  over  his  current  partner  if  m  is  married  or  over  being  single 
if  m  is  not  married;  and  w  prefers  m  over  her  current  partner  if  w  is  married  or 
over  being  single  if  w  is  not  married. 


5 Check  that  there  are  34  possible  different  matchings  for  this  problem. 
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Obviously,  if  (cl)  were  violated  then  the  person  in  question  could  improve  by 
divorcing  and  becoming  single;  if  (c2)  were  violated  then  m  and  w  would  both  be 
better  off  by  marrying  each  other.  A  matching  satisfying  (cl)  and  (c2)  is  called 
stable.  Hence,  any  matching  in  the  core  must  be  stable.  Interestingly,  the  converse  is 
also  true:  any  stable  matching  is  in  the  core.  To  see  this,  suppose  there  is  a  matching 
outside  the  core  and  satisfying  (cl)  and  (c2).  Then  there  is  a  coalition  of  agents 
each  of  whom  can  improve  by  marrying  or  staying  single  within  that  coalition.  If 
a  member  of  the  coalition  improves  by  becoming  single,  then  (cl)  is  violated.  If 
two  coalition  members  improve  by  marrying  each  other,  then  (c2)  is  violated.  This 
contradiction  establishes  the  claim  that  stable  matchings  must  be  in  the  core.  Thus, 
the  core  of  a  matching  problem  is  the  set  of  all  stable  matchings. 

How  can  stable  matchings  be  computed?  A  convenient  procedure  is  the  deferred 
acceptance  procedure ,  developed  by  Gale  and  Shapley.  In  this  procedure,  the 
members  of  one  of  the  two  parties  propose  and  the  members  of  the  other  party  accept 
or  reject  proposals.  Suppose  men  propose.  In  the  first  round,  each  man  proposes  to 
his  favorite  woman  (or  stays  single  if  he  prefers  that)  and  each  woman,  if  proposed 
to  at  least  once,  chooses  her  favorite  man  among  those  who  have  proposed  to  her 
(which  may  mean  staying  single).  This  way,  a  number  of  couples  may  form,  and 
the  involved  men  and  women  are  called  ‘engaged’ .  In  the  second  round,  the  rejected 
men  propose  to  their  second-best  woman  (or  stay  single);  then  each  woman  again 
picks  here  favorite  among  the  men  who  proposed  to  her  including  possibly  the  man 
to  whom  she  is  currently  engaged.  The  procedure  continues  until  all  proposals  are 
accepted.  Then  all  currently  engaged  couples  marry  and  a  matching  is  established. 

It  is  not  hard  to  verify  that  this  matching  is  stable.  A  man  who  stays  single  was 
rejected  by  all  women  he  preferred  over  staying  single  and  therefore  can  find  no 
woman  who  prefers  him  over  her  husband  or  over  being  single.  A  woman  who  stays 
single  was  never  proposed  to  by  any  man  whom  she  prefers  over  staying  single. 
Consider,  finally,  an  m  e  M  and  aw  e  W  who  are  married  but  not  to  each  other.  If 
m  prefers  w  over  his  current  wife,  then  w  must  have  rejected  him  for  a  better  partner 
somewhere  in  the  procedure.  If  w  prefers  m  over  her  current  husband,  then  m  has 
never  proposed  to  her  and,  thus,  prefers  his  wife  over  her. 

Of  course,  the  deferred  acceptance  procedure  can  also  be  applied  with  women  as 
proposers,  resulting  in  a  stable  matching  that  is  different  in  general. 

Table  10.2  shows  how  the  deferred  acceptance  procedure  with  the  men  proposing 
works,  applied  to  the  matching  problem  in  Table  10.1. 

There  may  be  other  stable  matchings  than  those  found  by  applying  the  deferred 
acceptance  procedure  with  the  men  and  with  the  women  proposing.  It  can  be  shown 


Table  10.2  The  deferred 
acceptance  procedure  applied 
to  the  matching  problem  of 
Table  10.1 


Stage  1 

Stage  2 

Stage  3 

Stage  4 

mi 

->  w2 

rejected 

->  w  1 

m2 

->  w  1  rejected 

->  w2 

m3 

w  1 

rejected 

->  W2  rejected 

The  resulting  matching  is  (mi,  w i),  (m2,  W2),  m3,  W3 
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that  the  former  procedure — with  the  men  proposing — results  in  a  stable  matching 
that  is  optimal,  among  all  stable  matchings,  from  the  point  of  view  of  the  men, 
whereas  the  latter  procedure — with  the  women  proposing — produces  the  stable 
matching  optimal  from  the  point  of  view  of  the  women.  See  also  Problems  10.6 
and  10.7. 


1 0.4  House  Exchange 

In  a  house  exchange  problem  each  one  of  finitely  many  agents  owns  a  house,  and 
has  a  preference  over  all  houses.  The  purpose  of  the  exchange  is  to  make  the  agents 
better  off.  A  house  exchange  problem  is  an  exchange  economy  with  as  many  goods 
as  there  are  agents,  and  where  each  agent  is  endowed  with  one  unit  of  a  different, 
indivisible  good. 

Formally,  the  set  of  agents  is  N  —  {l, ...  ,n},  and  each  agent  i  e  N  owns  house 
hi ,  and  has  a  strict  preference  of  the  set  of  all  ( n )  houses.  In  a  core  allocation,  each 
agent  obtains  exactly  one  house,  and  there  is  no  coalition  that  can  make  each  of 
its  members  strictly  better  off  by  exchanging  their  initially  owned  houses  among 
themselves. 

As  an  example,  consider  the  house  exchange  problem  in  Table  10.3.  In  this 
problem  there  are  six  possible  different  allocations  of  the  houses.  Table  10.4  lists 
these  allocations  and  also  which  coalitions  could  improve  by  exchanging  their  own 
houses. 

Especially  for  larger  problems,  the  ‘brute  force’  analysis  as  in  Table  10.4  is  rather 
cumbersome.  A  different  and  more  convenient  way  is  to  use  the  top  trading  cycle 
procedure.  In  a  given  house  exchange  problem  a  top  trading  cycle  is  a  sequence 
4,  4,  •  •  • ,  4  of  agents,  with  k  >  1,  such  that  the  favorite  house  of  agent  i\  is  house 
hi2,  the  favorite  house  of  agent  4  is  house  hi3,  . . .,  and  the  favorite  house  of  agent  4 
is  house  htl  .If  k  —  1,  then  this  simply  means  that  agent  i\  already  owns  his  favorite 
house.  In  the  top  trading  cycle  procedure,  we  look  for  a  top  trading  cycle,  assign 
houses  within  the  cycle,  and  next  the  involved  agents  and  their  houses  leave  the 
scene.  Then  we  repeat  the  procedure  for  the  remaining  agents,  etc.,  until  no  agent  is 
left. 


Table  10.3  A  house  exchange  problem  with  three  agents.  For  instance,  agent  1  prefers  the  house 
of  agent  3  over  the  house  of  agent  2  over  his  own  house 


Agent  1 

Agent  2 

Agent  3 

h 

hi 

h2 

h2 

h2 

hi 

h 

in 

6 Hence,  by  definition  players  in  coalitions  can  only  possibly  improve  by  exchanging  their  initially 
owned  houses,  not  the  houses  they  acquired  after  the  exchange  has  taken  place. 
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Table  10.4  Analysis  of  the 
house  exchange  problem  of 
Table  10.3.  There  are  two 
core  allocations 


Agent  1 

Agent  2 

Agent  3 

Improving  coalition(s) 

hi 

hi 

hi 

{1,2}, {1,2,3} 

hi 

hi 

h2 

{2},  {1,2} 

hi 

hi 

hi 

None:  core  allocation 

h2 

hi 

hi 

{2},  {3},  {2, 3},  {1,2, 3} 

h 

hi 

hi 

None:  core  allocation 

h 

h2 

hi 

{3} 

In  the  example  in  Table  10.3  there  is  only  one  top  trading  cycle,  namely  1, 3,  2, 
resulting  in  the  allocation  1  :  hi,  3  :  /*2,  2  :  h\,  a  core  allocation:  in  fact,  each  agent 
obtains  his  top  house.  In  general,  it  is  true  that  for  strict  preferences  the  top  trading 
cycle  procedure  results  in  a  core  allocation.  The  reader  should  check  the  validity  of 
this  claim  (Problem  10.8). 

What  about  the  other  core  allocation  found  in  Table  10.4?  In  this  allocation,  the 
grand  coalition  could  weakly  improve:  by  the  allocation  1  :  hs,  3  :  2  :  h\  agents 

1  and  3  would  be  strictly  better  off,  while  agent  2  would  not  be  worse  off.  We  define 
the  strong  core  as  consisting  of  those  allocations  on  which  no  coalition  could  even 
weakly  improve,  that  is,  make  all  its  members  at  least  as  well  off  and  at  least  one 
member  strictly  better  off.  In  the  example,  only  the  allocation  1  :  3  :  h2,  2  :  h\  is 

in  the  strong  core.  In  general,  one  can  show  that  the  strong  core  of  a  house  exchange 
problem  with  strict  preferences  consists  of  the  unique  allocation  produced  by  the 
top  trading  cycle  procedure. 


10.5  Problems 

10.1.  A  Division  Problem  (1) 

Suppose  two  players  (bargainers)  bargain  over  the  division  of  one  unit  of  a  perfectly 

divisible  good.  Player  1  has  utility  function  u\(a)  —  a  and  player  2  has  utility 

function  U2(P)  =  1  —  (1  —  /3)2  for  amounts  a,  e  [0, 1]  of  the  good. 

(a)  Determine  the  set  of  feasible  utility  pairs.  Make  a  picture. 

(b)  Determine  the  Nash  bargaining  solution  outcome,  in  terms  of  utilities  as  well  as 
of  the  distribution  of  the  good. 

(c)  Suppose  the  players’  utilities  are  discounted  by  a  factor  8  e  [0, 1).  Calculate  the 
Rubinstein  bargaining  outcome,  i.e.,  the  subgame  perfect  equilibrium  outcome 
of  the  infinite  horizon  alternating  offers  bargaining  game. 

(d)  Determine  the  limit  of  the  Rubinstein  bargaining  outcome,  for  8  approaching  1 , 
in  two  ways:  by  using  the  result  of  (b)  and  by  using  the  result  of  (c). 
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10.2.  A  Division  Problem  (2) 

Suppose  that  two  players  (bargainers)  bargain  over  the  division  of  one  unit  of  a 
perfectly  divisible  good.  Assume  that  player  1  has  utility  function  u(a)  (0  <  a  <  1) 
and  player  2  has  utility  function  v(a)  =  2 u(a)  (0  <  a  <  1). 

Determine  the  distribution  of  the  good  according  to  the  Nash  bargaining  solution. 
Can  you  say  something  about  the  resulting  utilities?  (Hint:  use  the  relevant 
properties  of  the  Nash  bargaining  solution.) 

10.3.  A  Division  Problem  (3) 

Suppose  that  two  players  (bargainers)  bargain  over  the  division  of  two  units  of  a 
perfectly  divisible  good.  Assume  that  player  1  has  a  utility  function  u(a)  —  | 
(0  <  a  <  2)  and  player  2  has  utility  function  v(a)  =  ^/ot  (0  <  a  <  2). 

(a)  Determine  the  distribution  of  the  good  according  to  the  Rubinstein  bargaining 
procedure,  for  any  discount  factor  0  <  8  <  1 . 

(b)  Use  the  result  to  determine  the  Nash  bargaining  solution  distribution. 

(c)  Suppose  player  l’s  utility  function  changes  to  w(a)  =  a  for  0  <  a  <  1.6  and 
w(a)  —  1.6  for  1.6  <  a  <  2.  Determine  the  Nash  bargaining  solution  outcome, 
both  in  utilities  and  in  distribution,  for  this  new  situation. 

10.4.  An  Exchange  Economy 

Consider  an  exchange  economy  with  two  agents  A  and  B  and  two  goods.  The  agents 
are  endowed  with  initial  bundles  eA  =  (3, 1)  and  eB  —  (1,3).  Their  preferences 
are  represented  by  the  utility  functions  uA(x  1,^2)  =  ln(vi  +  1)  +  \n(x2  +  2)  and 
uB(x  1,^2)  =  31n(vi  +  1)  +  ln(^2  +  1). 

(a)  Compute  the  demand  functions  of  the  agents. 

(b)  Compute  Walrasian  equilibrium  prices  and  the  equilibrium  allocation. 

(c)  Compute  the  contract  curve  and  the  core.  Sketch  the  Edgeworth  box. 

(d)  Show  that  the  Walrasian  equilibrium  allocation  is  in  the  core. 

(e)  How  would  you  set  up  a  two-person  bargaining  problem  associated  with  this 
economy?  Would  it  make  sense  to  consider  the  Nash  bargaining  solution  in  order 
to  compute  an  allocation?  Why  or  why  not? 

10.5.  The  Matching  Problem  of  Table  10.1  Continued 

(a)  Apply  the  deferred  acceptance  procedure  to  the  matching  problem  of  Table  10.1 
with  the  women  proposing. 

(b)  Are  there  any  other  stable  matchings  in  this  example? 


10.6.  Another  Matching  Problem 

Consider  the  matching  problem  with  three  men,  three  women,  and  preferences  as  in 
Table  10.5. 
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Table  10.5  The  matching 
problem  of  Problem  10.6 


Table  10.6  The  matching 
problem  of  Problem  10.7 


m\ 

m2 

m3 

W\ 

w2 

w3 

w  1 

W\ 

W\ 

m  ] 

m\ 

m\ 

w2 

w2 

w3 

m2 

m3 

m2 

w3 

w3 

w2 

m3 

m2 

m3 

mi 

m2 

m3 

W\ 

w2 

w3 

w2 

W\ 

W\ 

m3 

m\ 

w  1 

w3 

w2 

m3 

m\ 

m3 

w3 

w2 

w3 

m2 

m2 

m2 

(a)  Compute  the  two  matchings  produced  by  the  deferred  acceptance  procedure 
with  the  men  and  with  the  women  proposing. 

(b)  Are  there  any  other  stable  matchings? 

(c)  Verify  the  claim  made  in  the  text  about  the  optimality  of  the  matchings  in  (a)  for 
the  men  and  the  women,  respectively. 


10.7.  Yet  Another  Matching  Problem:  Strategic  Behavior 

Consider  the  matching  problem  with  three  men,  three  women,  and  preferences  as  in 
Table  10.6. 

(a)  Compute  the  two  matchings  produced  by  the  deferred  acceptance  procedure 
with  the  men  and  with  the  women  proposing. 

(b)  Are  there  any  other  stable  matchings? 

Now  consider  the  following  noncooperative  game.  The  players  are  w\,  W2,  and  W3. 
The  strategy  set  of  a  player  is  simply  the  set  of  all  possible  preferences  over  the  men. 
(Thus,  each  player  has  16  different  strategies.)  The  outcomes  of  the  game  are  the 
matchings  produced  by  the  deferred  acceptance  procedure  with  the  men  proposing, 
assuming  that  each  man  uses  his  true  preference  given  in  Table  10.6. 

(c)  Show  that  the  following  preferences  form  a  Nash  equilibrium:  W2  and  W3 
use  their  true  preferences,  as  given  in  Table  10.6;  w\  uses  the  preference 
(mi, m2, m3).  Conclude  that  sometimes  it  may  pay  off  to  lie  about  one’s  true 
preference.  (Hint:  in  a  Nash  equilibrium,  no  player  can  gain  by  deviating.) 

10.8.  Core  Property  of  Top  Trading  Cycle  Procedure 

Show  that  for  strict  preferences  the  top  trading  cycle  results  in  a  core  allocation. 

10.9.  House  Exchange  with  Identical  Preferences 

Consider  the  n-agent  house  exchange  problem  where  all  agents  have  identical  strict 
preferences.  Find  the  house  allocation(s)  in  the  core. 
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Table  10.7  The  house 
exchange  problem  of 
Problem  10.10 


Player  1 

Player  2 

Player  3 

Player  4 

hi 

/14 

hi 

hi 

h2 

hi 

/l4 

h2 

h2 

hi 

hi 

hi 

h 

h2 

tl4 

10.10.  A  House  Exchange  Problem 

Consider  the  house  exchange  problem  with  four  agents  in  Table  10.7. 

Compute  all  core  allocations  and  all  strong  core  allocations. 

10.11.  Cooperative  Oligopoly 

Consider  the  Cournot  oligopoly  game  with  n  firms  with  different  costs  c\ ,  C2, . . . ,  cn. 
(This  is  the  game  of  Problem  6.2  with  heterogenous  costs.)  As  before,  each  firm  i 
offers  qt  >  0,  and  the  price-demand  function  is  p  —  max{0,<2  —  YTi=  l  #/}>  where 
0  <  Ci  <  a  for  all  i. 


(a)  Show  that  the  reaction  function  of  player  i  is 

a  —  Ci  — 
qt  —  max{0, - - 


(b)  Show  that  the  unique  Nash  equilibrium  of  the  game  is  q*  =  (g* , . . . ,  q*)  with 

a  -  nc,  +  Cj 
Qi  = - — ; - - 

for  each  i,  assuming  that  this  quantity  is  positive. 

(c)  Derive  that  the  corresponding  profits  are 


(a  -  net  +  Yltfj  Q')2 
(n  +  l)2 


for  each  player  i. 

Let  the  firms  now  be  the  players  in  a  cooperative  TU-game  with  player  set  N  — 
{1,2, ...  ,n},  and  consider  a  coalition  S  c  N.  What  is  the  total  profit  that  S  can 
make  on  its  own?  This  depends  on  the  assumptions  that  we  make  on  the  behavior  of 
the  players  outside  S.  Very  pessimistically,  one  could  solve  the  problem 

max  min  Y'  Pt(qi  ,...,qn), 

qr-i^s  qyj^s  “ 
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which  is  the  profit  that  S  can  guarantee  independent  of  the  players  outside  S.  This 
view  is  very  pessimistic  because  it  presumes  maximal  resistance  of  the  outside 
players,  even  if  this  means  that  these  outside  players  hurt  themselves.  In  the  present 
case  it  is  not  hard  to  see  that  this  results  in  zero  profit  for  S. 

Two  alternative  scenarios  are:  S  plays  a  Cournot-Nash  equilibrium  in  the  ( n  — 

1 5 1  +  1) -player  oligopoly  game  against  the  outside  firms  as  separate  firms,  or  S 
plays  a  Cournot-Nash  equilibrium  in  the  duopoly  game  against  N\S. 

In  the  first  case  we  in  fact  have  an  oligopoly  game  with  costs  q  for  every  player 
j  £  S  and  with  cost  cs  :=  min{c/  :  i  e  S}  for  ‘player’  (coalition)  S. 


(d)  By  using  the  results  of  (a)-(c)  show  that  coalition  S  obtains  a  profit  of 


vi(S)  = 


[a  —  (n—  | S' |  +  l)c5  +  J2j£s  cj ]2 
Jn  -  jsj  +  2) 2 


in  this  scenario.  Thus,  this  scenario  results  in  a  cooperative  TU-game  (N,  v\). 

(e)  Assume  n  —  3,  a  —  7,  and  q  =  i  for  i  —  1, 2,  3.  Compute  the  core,  the  Shapley 
value,  and  the  nucleolus  for  the  TU-game  ( N ,  v\). 

(f)  Show  that  in  the  second  scenario,  coalition  S  obtains  a  profit  of 


(a-  2c s  +  cN-S)2 

vi(S)  =  - - - , 

resulting  in  a  cooperative  game  ( N ,  v?) . 

(g)  Assume  n  —  3,  a  —  7,  and  q  —  i  for  i  —  1, 2,  3.  Compute  the  core,  the  Shapley 
value,  and  the  nucleolus  for  the  TU-game  (A,  V2). 


1 0.6  Notes 

The  Nash  bargaining  model  and  solution  were  proposed  and  characterized  in 
Nash  (1950).  Critique  on  the  independence  of  irrelevant  alternatives  axiom  was 
formalized  in  Kalai  and  Smorodinsky  (1975),  see  Chap.  21. 

For  a  proof  of  the  First  Welfare  Theorem,  see  for  instance  Jehle  and  Reny  (2001). 
There  one  can  also  find  a  proof  of  the  fact  that  the  Walrasian  equilibrium  results  in 
a  core  allocation. 

The  deferred  acceptance  procedure  for  matching  problems  was  first  proposed 
in  Gale  and  Shapley  (1962).  Our  introduction  to  matching  problems  and  house 
exchange  is  largely  based  on  Osborne  (2004).  Also  Problems  10.5-10.7  are  from 
that  source.  Problem  10.10  is  from  Moulin  (1995). 

Two-person  bargaining  problems  and  TU-games  (Chap.  9)  are  both  special  cases 
of  the  general  model  of  cooperative  games  without  transferable  utility ,  so-called 
NTU-games.  In  an  NTU-game,  a  set  of  feasible  utility  vectors  V(T)  is  assigned  to 
each  coalition  T  of  players.  For  a  TU-game  ( N ,  v)  and  a  coalition  T,  this  set  takes 
the  special  form  V(T)  —  {x  e  R"  |  J2ierxi  —  i-e-’  a  coalition  T  can  attain 
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any  vector  of  utilities  such  that  the  sum  of  the  utilities  for  the  players  in  T  does  not 
exceed  the  worth  of  the  coalition.  In  a  two-player  bargaining  problem  ( S ,  d),  one  can 
set  V({1,2})  =  S  and  V({/})  =  {a  E  M  |  a  <  d;}  for  i  —  1,2.  See  also  Chap.  21. 
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Social  Choice 


Social  choice  theory  studies  the  aggregation  of  individual  preferences  into  a 
common  or  social  preference.  It  overlaps  with  several  social  science  disciplines, 
such  as  political  theory  (e.g.,  voting  for  Parliament,  or  for  a  president)  and  game 
theory  (e.g.,  voters  may  vote  strategically,  or  candidates  may  choose  positions 
strategically). 

In  the  classical  model  of  social  choice,  there  is  a  finite  number  of  agents  who 
have  preferences  over  a  finite  number  of  alternatives.  These  preferences  are  either 
aggregated  into  a  social  preference  according  to  a  so-called  social  welfare  function, 
or  result  in  a  common  alternative  according  to  a  so-called  social  choice  function. 

The  main  purpose  of  this  chapter  is  to  review  two  classical  results,  namely 
Arrow’s  Theorem  and  the  Gibb ard-Satterth waite  Theorem.  The  first  theorem  applies 
to  social  welfare  functions  and  says  that,  if  the  social  preference  between  any 
two  alternatives  should  only  depend  on  the  individual  preferences  between  these 
alternatives  and,  thus,  not  on  individual  preferences  involving  other  alternatives, 
then  basically  the  social  welfare  function  must  be  dictatorial.  The  second  theorem 
applies  to  social  choice  functions  and  says  that,  basically,  the  only  social  choice 
functions  that  are  invulnerable  to  strategic  manipulation  are  the  dictatorial  ones. 
These  results  are  often  referred  to  as  ‘impossibility  theorems’  since  dictatorships 
are  generally  regarded  undesirable. 

Section  11.1  is  introductory.  Section  11.2  discusses  Arrow’s  Theorem  and 
Sect.  11.3  the  Gibbard-Satterth waite  Theorem. 


11.1  Introduction  and  Preliminaries 

11.1.1  An  Example 

Suppose  there  are  three  agents  (individuals,  voters)  who  have  strict  preferences  over 
a  set  of  five  alternatives  (a\, ... ,  a5 ),  as  given  in  Table  11.1. 
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Table  11.1  Borda  scores 


Agent 

a\ 

a2 

a3 

a5 

1 

5 

1 

3 

2 

4 

2 

1 

2 

3 

4 

5 

3 

3 

4 

5 

2 

1 

In  this  table  the  preferences  of  the  players  are  represented  by  the  Borda  scores : 
the  best  alternative  of  an  agent  obtains  5  points,  the  second  best  4  points,  etc.,  until 
the  worst  alternative  which  obtains  1  point.  For  instance,  agent  1  has  the  preference 
a\P\asP\a^P\a/^P\a2  in  the  notation  to  be  introduced  below.  We  use  the  Borda 
scores  as  a  convenient  way  to  represent  these  preferences  and,  more  importantly, 
to  obtain  an  example  of  a  social  welfare  as  well  as  a  social  choice  function. 

First,  suppose  that  we  want  to  extract  a  common  social  ranking  of  the  alternatives 
from  the  individual  preferences.  One  way  to  do  this  is  to  add  the  Borda  scores 
per  alternative.  In  the  example  this  results  in  9,7,  11,8, 10  for  a\,  <22,(23,  <24,  as, 
respectively,  resulting  in  the  social  ranking  as,PasPa\Pa^Pa2 .  If  we  just  want  to 
single  out  one  alternative,  then  we  could  take  the  one  with  the  maximal  Borda  score, 
in  this  case  alternative  a 3.  In  the  terminology  to  be  introduced  formally  below,  Borda 
scores  give  rise  to  a  social  welfare  as  well  as  a  social  choice  function. 

One  potential  drawback  of  using  Borda  scores  to  obtain  a  social  ranking  is, 
that  the  ranking  between  two  alternatives  may  not  just  depend  on  the  individual 
preferences  between  these  two  alternatives.  For  instance,  suppose  that  agent  l’s 
preference  would  change  to  a\P\a^P\asP\asP\a2.  Then  the  Borda  scores  would 
change  to  9,7,  10,  10,9  for  <21,  <22,  <23,  <24,  <25,  respectively,  resulting  in  the  social 
ranking  asIa^Pa\IasPa2  (where  I  denotes  indifference).  Observe  that  no  agent’s 
preference  between  a\  and  <24  has  changed,  but  that  socially  this  preference  is 
reversed.  This  is  not  a  peculiarity  of  using  Borda  scores:  Arrow’s  Theorem,  to  be 
discussed  in  Sect.  11.2,  states  that  under  some  reasonable  additional  assumptions 
the  only  way  to  avoid  this  kind  of  preference  reversal  is  to  make  one  agent  the 
dictator,  i.e.,  to  have  the  social  preference  coincide  with  the  preference  of  one  fixed 
agent. 

Another  potential  drawback  of  using  the  Borda  scores  in  order  to  single  out  a 
unique  alternative  is  that  this  method  is  vulnerable  to  strategic  manipulation.  For 
instance,  suppose  that  agent  1  would  lie  about  his  true  preference  given  in  Table  11.1 
and  claim  that  his  preference  is  a\P\asP\a2P\a^P\as  instead.  Then  the  Borda  scores 
would  change  to  9,9,9,8,10  for  <21,  <22,  <23,  <24,  <25,  respectively,  resulting  in  the 
chosen  alternative  <25  instead  of  <23.  Since  agent  1  prefers  <25  over  <23  according  to  his 
true  preference,  he  gains  by  this  strategic  manipulation.  Again,  this  phenomenon 
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is  not  a  peculiarity  of  the  Borda  method:  the  Gibbard-Satterthwaite  Theorem  in 
Sect.  11.3  shows  that  again  under  some  reasonable  additional  assumptions  the  only 
way  to  avoid  it  is  to  make  one  fixed  agent  a  dictator. 


11.1.2  Preliminaries 

Let  A  =  {a\ , . . . ,  am}  be  the  set  of  alternatives.  To  keep  things  interesting  we  assume 
m  >  3.  The  set  of  agents  is  denoted  by  N  =  { 1 , ,n}.  We  assume  n  >  2. 

A  binary  relation  on  A  is  a  subset  of  A  x  A.  In  our  context,  for  a  binary  relation 
R  on  A  we  usually  write  aRb  instead  of  (a,  b)  e  R  and  interpret  this  as  an  agent  or 
society  (weakly)  preferring  a  over  b.  Well-known  conditions  for  a  binary  relation  R 
on  A  are: 

(a)  Reflexivity :  aRa  for  all  a  e  A. 

(b)  Completeness:  aRb  or  bRa  for  all  a,b  e  A  with  a  ^  b. 

(c)  Antisymmetry:  For  all  a,b  e  A,  if  aRb  and  bRa ,  then  a  —  b. 

(d)  Transitivity:  For  all  a,b,c  e  A,  aRb  and  bRc  imply  aRc. 

A  preference  on  A  is  a  reflexive,  complete  and  transitive  binary  relation  on  A. 
For  a  preference  R  on  A  we  write  aPb  if  aRb  and  not  bRa;  and  alb  if  aRb 
and  bRa.  The  binary  relations  P  and  I  are  called  the  asymmetric  and  symmetric 
parts  of  R ,  respectively,  and  are  interpreted  as  strict  preference  and  indifference. 
Check  (Problem  11.1)  that  P  is  antisymmetric  and  transitive  but  not  reflexive  and 
not  necessarily  complete,  and  that  I  is  reflexive  and  transitive  but  not  necessarily 
antisymmetric  and  not  necessarily  complete.  By  £*  we  denote  the  set  of  all 
preferences  on  A,  and  by  C  c  £*  the  set  of  all  antisymmetric  (i.e.,  strict ) 
preferences  on  A.  In  plain  words,  elements  of  £*  order  the  elements  of  A  but  allow 
for  indifferences,  while  elements  of  C  order  the  elements  of  A  strictly. 

In  what  follows,  it  is  assumed  that  agents  have  strict  preferences  while 
social  preferences  may  have  indifferences.  A  strict  preference  profile  is  a  list 
(R\ , . . . ,  Ri, . . . ,  Rn ),  where  Ri  is  the  strict  preference  of  agent  i.  Hence,  CN  denotes 
the  set  of  all  strict  preference  profiles.  A  social  choice  function  is  a  map /  :  CN  ->  A, 
i.e.,  it  assigns  a  unique  alternative  to  every  profile  of  strict  preferences.  A  social 
welfare  function  is  a  map  F  :  CN  C* ,  i.e.,  it  assigns  a  (possibly  non-strict) 
preference  to  every  profile  of  strict  preferences. 


2See  Problem  11.7  for  the  case  m  =  2. 

3Elements  of  C  are  usually  called  linear  orders  and  those  of  £*  weak  orders. 
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1 1 .2  Arrow's  Theorem 

In  this  section  the  focus  is  on  social  welfare  functions.  We  formulate  three  properties 
for  a  social  welfare  function  F  :  CN  ->  £* .  We  call  F: 

(a)  Pareto  Efficient  (PE)  if  for  each  profile  (R\, . . .  ,Rn )  e  CN  and  all  a,b  £  A, 
if  a  ^  Z?  and  aRib  for  all  i  £  N,  then  aPb ,  where  P  is  the  asymmetric  part  of 
R  —  F(R\, . . .  ,Rn ). 

(b)  Independent  of  Irrelevant  Alternatives  (IIA)  if  for  all  (R\ , . . . ,  Rn )  £  CN  and 

(R[, ... ,  R'n)  £  CN  and  all  a,  b  £  A,  if  aRib  <£>  for  all  i  £  N ,  then  <£> 
aTvf/?,  where  R  —  F(R\ , . . . ,  Rn )  and  =  F(R[ , ... ,  R'n). 

(c)  Dictatorial  (D)  if  there  is  an  i  £  N  such  that  F(R\, . . .  ,Rn )  =  R[  for  all 
(Ru...,Rn)eCN. 

Pareto  Efficiency  requires  that,  if  all  agents  prefer  an  alternative  a  over  an  alternative 
b ,  then  the  social  ranking  should  also  put  a  above  b.  Independence  of  Irrelevant 
Alternatives  says  that  the  social  preference  between  two  alternatives  should  only 
depend  on  the  agents’  preferences  between  these  two  alternatives  and  not  on  the 
position  of  any  other  alternative.4 5  Dictatoriality  says  that  the  social  ranking  is  always 
equal  to  the  preference  of  a  fixed  agent,  the  dictator.  Clearly,  there  are  exactly  n 
dictatorial  social  welfare  functions. 

The  first  two  conditions  are  usually  regarded  as  desirable  but  the  third  clearly 
not.  Unfortunately,  Arrow’s  Theorem  implies  that  the  first  two  conditions  imply  the 
third  one. 

Theorem  11.1  (Arrow’s  Theorem)  Let  F  be  a  Pareto  Efficient  and  IIA  social 
welfare  function.  Then  F  is  dictatorial. 

Proof 

Step  1  Consider  a  profile  in  CN  and  two  distinct  alternatives  a,b  £  A  such  that 
every  agent  ranks  a  on  top  and  b  at  bottom.  By  Pareto  Efficiency,  the  social  ranking 
assigned  by  F  must  also  rank  a  on  top  and  b  at  bottom. 

Now  change  agent  l’s  ranking  by  raising  b  in  it  one  position  at  a  time.  By  IIA, 
a  is  ranked  socially  (by  F)  on  top  as  long  as  b  is  still  below  a  in  the  preference  of 
agent  1 .  In  the  end,  if  agent  1  ranks  b  first  and  a  second,  we  have  a  or  b  on  top  of  the 
social  ranking  by  Pareto  efficiency  of  F.  If  a  is  still  on  top  in  the  social  ranking,  then 
continue  the  same  process  with  agents  2,  3,  etc.,  until  we  reach  some  agent  k  such 
that  b  is  on  top  of  the  social  ranking  after  moving  b  above  a  in  agent  k’s  preference. 


4  Although  there  is  some  similarity  in  spirit,  this  condition  is  not  in  any  formal  sense  related  to  the 
IIA  condition  in  bargaining,  see  Sect.  10.1  or  Chap.  21. 

5 For  this  reason  the  theorem  is  often  referred  to  as  Arrow’s  Impossibility  Theorem. 
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Table  1 1 .2  Step  1  of  the 
proof  of  Theorem  11.1,  agent 
k  ranks  a  above  b 


Table  1 1 .3  Step  1  of  the 
proof  of  Theorem  11.1,  agent 
k  ranks  b  above  a 


Table  1 1 .4  Step  2  of  the 
proof  of  Theorem  11.1, 
arising  from  Table  11.2 


Table  1 1 .5  Step  2  of  the 
proof  of  Theorem  11.1, 
arising  from  Table  11.3 
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Tables  1 1.2  and  1 1.3  give  the  situations  just  before  and  just  after  b  is  placed  above 
a  in  agent  k’s  preference. 

Step  2  Now  consider  Tables  1 1 .4  and  11.5. 

The  profile  in  Table  11.4  arises  from  the  one  in  Table  11.2  by  moving  a  to  the  last 
position  for  agents  i  <  k  and  to  the  second  last  position  for  agents  i  >  k.  In  exactly 
the  same  way,  the  profile  in  Table  11.5  arises  from  the  one  in  Table  11.3.  Then  II A 
applied  to  Tables  11.3  and  11.5  implies  that  b  is  socially  top-ranked  in  Table  11.5. 
Next,  IIA  applied  to  the  transition  from  Table  11.5  to  Table  11.4  implies  that  in 


6In  these  tables  and  also  the  ones  below,  we  generically  denote  all  preferences  by  R\, . . .  ,Rn.  The 
last  column  in  every  table  will  be  used  in  Sect.  11.3. 
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Table  1 1 .6  Step  3  of  the 
proof  of  Theorem  11.1 


Table  1 1 .7  Steps  4  and  5  of 
the  proof  of  Theorem  11.1 
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Table  11.4  b  must  still  be  socially  ranked  above  every  alternative  except  perhaps 
a.  But  IIA  applied  to  the  transition  from  Table  11.2  to  Table  11.4  implies  that  in 
Table  11.4  a  must  still  be  socially  ranked  above  every  alternative.  This  proves  that 
the  social  rankings  in  Tables  11.4  and  11.5  are  correct. 

Step  3  Consider  a  third  alternative  c  distinct  from  a  and  b.  The  social  ranking  in 
Table  11.6  is  obtained  by  from  Table  11.4  by  applying  IIA. 


Step  4  Consider  the  profile  in  Table  11.7,  obtained  from  the  profile  in  Table  1 1 .6  by 
switching  a  and  b  for  agents  i  >  k.  By  IIA  applied  to  the  transition  from  Table  11.6 
to  Table  11.7,  we  have  that  a  must  still  be  socially  ranked  above  every  alternative 
except  possibly  b.  However,  b  must  be  ranked  below  c  by  Pareto  efficiency,  which 
shows  that  the  social  ranking  in  Table  11.7  is  correct. 

Step  5  Consider  any  arbitrary  profile  in  which  agent  k  prefers  a  to  b.  Change  the 
profile  by  moving  c  between  a  and  b  for  agent  k  and  to  the  top  of  every  other  agent’s 
preference  (if  this  is  not  already  the  case).  By  IIA  this  does  not  affect  the  social 
ranking  of  a  vs.  b.  Since  the  preference  of  every  agent  concerning  a  and  c  is  now  as 
in  Table  11.7,  IIA  implies  that  a  is  socially  ranked  above  c,  which  itself  is  socially 
ranked  above  b  by  Pareto  Efficiency.  Hence,  by  transitivity  of  the  social  ranking  we 
may  conclude  that  a  is  socially  ranked  above  b  whenever  it  is  preferred  by  agent  k 
over  b.  By  repeating  the  argument  with  the  roles  of  b  and  c  reversed,  and  recalling 
that  c  was  an  arbitrary  alternative  distinct  from  a  and  /?,  we  may  conclude  that  the 
social  ranking  of  a  is  above  some  alternative  whenever  agent  k  prefers  a  to  that 
alternative:  k  is  a  ‘dictator’  for  a.  Since  a  was  arbitrary,  we  can  repeat  the  whole 
argument  to  conclude  that  there  must  be  a  dictator  for  every  alternative.  Since  there 
cannot  be  distinct  dictators  for  distinct  alternatives,  there  must  be  a  single  dictator 
for  all  alternatives.  ■ 
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1 1 .3  The  Gibbard-Satterthwaite  Theorem 

The  Gibbard-Satterthwaite  Theorem  applies  to  social  choice  functions.  We  start  by 
listing  the  following  possible  properties  of  a  social  choice  function/  :  CN  ->  A. 
Call/: 

(a)  Unanimous  (UN)  if  for  each  profile  (R\ , . . . ,  Rn )  G  CN  and  each  a  G  A,  if 
for  all  i  G  Af  and  all  Z?  G  A  \  {<2},  then/(7?i , . . . ,  Rn )  =  a. 

(b)  Monotonic  (MON)  if  for  all  profiles  (R\, ... ,  Rn )  G  CN  and  {R\,  ...  ,  R'n )  G  72^ 
and  all  a  e  A,  if  f(R\ , . . . ,  7?„)  =  a  and  a7?/A  =>>  aT?-/?  for  all  b  G  A  \  {a}  and 
i  G  Af,  then/^,  . . . ,  7*^)  =  a. 

(c)  Dictatorial  (D)  if  there  is  an  i  e  N  such  that  f(R\, . . . ,  Rn )  =  where  for 
all  b  G  A  \  {a},  for  all  (R\, . . . ,  Rn )  G  72^. 

(d)  Strategy-Proof  (SP)  if  for  all  profiles  (7?i , . . . ,  7?„)  G  72^  and  (7^ , . . . ,  R'n )  G  72^ 
and  all  i  e  N,  if  =  T?7  for  all  j  e  N  \  {/},  then/(7?i , . . . ,  7?n)  Rif(R[  ,...,R'n). 

Unanimity  requires  that,  if  all  agents  have  the  same  top  alternative,  then  this 
alternative  should  be  chosen.  Monotonicity  says  that,  if  some  alternative  a  is  chosen 
and  the  profile  changes  in  such  a  way  that  a  is  still  preferred  by  every  agent  over 
all  alternatives  over  which  it  was  originally  preferred,  then  a  should  remain  to  be 
chosen.  Dictatoriality  means  that  there  is  a  fixed  agent  whose  top  element  is  always 
chosen.  Strategy-Proofness  says  that  no  agent  can  obtain  a  better  chosen  alternative 
by  lying  about  his  true  preference. 

In  accordance  with  mathematical  parlance,  call  a  social  choice  function  /  : 
72^  ->  A  surjective  if  for  every  a  G  A  there  is  some  profile  (R\, ... ,  Rn )  G  CN 
such  that/(7?i, . . . ,  Rn )  =  a.  Hence,  each  a  is  chosen  at  least  once.7 8  The  Gibbard- 
Satterthwaite  Theorem  is  as  follows. 

Theorem  11.2  (Gibbard-Satterthwaite  Theorem)  Let  f  :  72^  ->  A  be  a  surjec¬ 
tive  and  strategy-proof  social  choice  function.  Thenf  is  dictatorial. 

Since  surjectivity  is  implied  by  unanimity,  Theorem  11.2  also  holds  with 
unanimity  instead  of  surjectivity. 

We  will  prove  the  Gibbard-Satterthwaite  Theorem  by  using  the  next  theorem, 
which  is  a  variant  of  the  Muller- Satterth waite  Theorem. 

Theorem  11.3  (Muller-Satterth waite)  Let  f  :  CN  —>  A  be  a  unanimous  and 
monotonic  social  choice  function.  Thenf  is  dictatorial. 

Proof  of  Theorem  11.2  We  prove  that  /  is  unanimous  and  monotonic.  The  result 
then  follows  from  Theorem  11.3. 


7This  property  is  also  called  Maskin  Monotonicity,  after  Maskin  (1999). 

8In  the  social  choice  literature  this  property  is  sometimes  called  citizen-sovereignty. 
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Suppose  that/(7?i, . . . ,  Rn )  =  a  for  some  profile  (R\, . . . ,  Rn )  G  CN  and  some 
alternative  a  e  A.  Let  i  g  N  and  let  (R[, . . . ,  R'n)  g  Cn  be  a  profile  such  that  for  all 
j  G  N  \  {/}  we  have  7^-  =  7?;  and  for  all  b  G  A  \  {a}  we  have  if  aRib.  We  wish  to 
show  that  f(R[ , . . . ,  7^?)  =  a.  Suppose,  to  the  contrary,  that  f(R[ , . . . ,  R'n )  —  b  ^  a. 
Then  SP  implies  aT?/,  and  hence  aR'ft.  Again  by  SP,  however,  bR[a,  hence,  by 
antisymmetry  of  R\,  a  —  /?,  a  contradiction.  This  proves  f(R[ , . . . ,  R'n )  =  a. 

Now  suppose  that  (R[ , . . . ,  7?'7)  g  Cn  is  a  profile  such  that  for  all  i  g  N  and 
all  Z?  G  A  \  {a}  we  have  if  a7?/>.  By  applying  the  argument  in  the  preceding 
paragraph  n  times,  it  follows  that  f(R[ , ... ,  R'n )  =  a.  Hence,  /  is  monotonic. 

To  prove  unanimity,  suppose  that  (R\ , . . . ,  Rn )  G  CN  and  a  G  A  such  that 
aRib  for  all  i  G  N  and  b  G  A\  {a}.  By  surjectivity  there  is  (R[ , . . .  ,7?/)  G 
with  f(R\ , . . . ,  7?^)  =  a.  By  monotonicity  we  may  move  a  to  the  top  of  each 
agent’s  preference  and  still  have  a  chosen.  Next,  again  by  monotonicity,  we  may 
change  each  agent  V s  preference  to  Ri  without  changing  the  chosen  alternative,  i.e., 
f{R\ , . . . ,  Rn )  =  a.  Hence,  /  is  unanimous.  ■ 

Proof  of  Theorem  11.3  The  proof  parallels  the  proof  of  Theorem  11.1  and  uses 
analogous  steps  and  the  same  tables. 


Step  1  Consider  a  profile  in  CN  and  two  distinct  alternatives  a,  b  G  A  such  that 
every  agent  ranks  a  on  top  and  b  at  bottom.  By  unanimity,  /  chooses  a. 

Now  change  agent  l’s  ranking  by  raising  b  in  it  one  position  at  a  time.  By  MON,  a 
is  chosen  by  /  as  long  as  b  is  still  below  a  in  the  preference  of  agent  1 .  In  the  end,  if 
agent  1  ranks  b  first  and  a  second,  we  have  a  or  b  chosen  by/,  again  by  MON.  If  a 
is  still  chosen,  then  continue  the  same  process  with  agents  2,  3,  etc.,  until  we  reach 
some  agent  k  such  that  b  is  chosen  after  moving  b  above  a  in  agent  k’s  preference. 
Tables  1 1.2  and  1 1.3  give  the  situations  just  before  and  just  after  b  is  placed  above 
a  in  agent  k’s  preference. 

Step  2  Now  consider  Tables  1 1 .4  and  11.5.  The  profile  in  Table  1 1 .4  arises  from  the 
one  in  Table  1 1.2  by  moving  a  to  the  last  position  for  agents  i  <  k  and  to  the  second 
last  position  for  agents  i  >  k.  In  exactly  the  same  way,  the  profile  in  Table  11.5 
arises  from  the  one  in  Table  1 1.3. 

Then  MON  applied  to  Tables  11.3  and  11.5  implies  that  b  is  chosen  in  Table  11.5. 
Next,  MON  applied  to  the  transition  from  Table  11.5  to  Table  11.4  implies  that  in 
Table  1 1 .4  the  choice  must  be  either  box  a.  Suppose  b  would  be  chosen.  Then  MON 
applied  to  the  transition  from  Table  11.4  to  Table  11.2  implies  that  in  Table  11.2 
b  must  be  chosen  as  well,  a  contradiction.  Hence,  a  is  chosen  in  Table  11.4.  This 
proves  that  the  choices  by  /  in  Tables  1 1.4  and  1 1.5  are  correct. 

Step  3  Consider  a  third  alternative  c  distinct  from  a  and  b.  The  choice  in  Table  11.6 
is  obtained  by  from  Table  1 1.4  by  applying  MON. 
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Step  4  Consider  the  profile  in  Table  11.7,  obtained  from  the  profile  in  Table  1 1 .6  by 
switching  a  and  b  for  agents  i  >  k.  If  the  choice  in  Table  11.7  were  some  d  unequal 
to  a  or  b ,  then  by  MON  it  would  also  be  d  in  Table  11.6,  a  contradiction.  If  it  were 
b ,  then  by  MON  it  would  remain  b  even  if  c  would  be  moved  to  the  top  of  every 
agent’s  preference,  contradicting  unanimity.  Hence,  it  must  be  a. 

Step  5  Consider  any  arbitrary  profile  with  a  at  the  top  of  agent  C s  preference.  Such 
a  profile  can  always  be  obtained  from  the  profile  in  Table  11.7  without  worsening 
the  position  of  a  with  respect  to  any  other  alternative  in  any  agent’s  preference.  By 
MON  therefore,  a  must  be  chosen  whenever  it  is  at  the  top  of  agent  fc’s  preference, 
so  k  is  a  ‘dictator’  for  a.  Since  a  was  arbitrary,  we  can  find  a  dictator  for  every  other 
alternative  but,  clearly,  these  must  be  one  and  the  same  agent.  Hence,  this  agent  is 
the  dictator.  ■ 

There  is  a  large  literature  that  tries  to  escape  the  negative  conclusions  of 
Theorems  1 1 . 1-1 1 .3  by  adapting  the  model  and/or  restricting  the  domain.  Examples 
are  provided  in  Problems  6.23  and  6.24. 


11.4  Problems 


11.1.  Preferences 

Let  R  be  a  preference  on  A,  with  symmetric  part  I  and  asymmetric  part  P. 

(a)  Prove  that  P  is  antisymmetric  and  transitive  but  not  reflexive  and  not  necessarily 
complete. 

(b)  Prove  that  I  is  reflexive  and  transitive  but  not  necessarily  complete  and  not 
necessarily  antisymmetric. 


11.2.  Pairwise  Comparison 

For  a  profile  r  —  (R\ , . . . ,  Rn)  e  CN  and  a,b  e  A  define 


N (a,  b,  r)  —  {i  e  N 


aRib}  , 


i.e.,  N(a,  b,r)  is  the  set  of  agents  who  (strictly)  prefer  a  to  b  in  the  profile  r.  With  r 
we  can  associate  a  binary  relation  C(r)  on  A  by  defining  aC{r)b  :<£>  | N(a,  b,r)\  > 
\N(b,  a,  r)  \  for  all  a,b  e  A.  If  aC(r)b  we  say  that  ‘ a  beats  b  by  pairwise  majority’. 


(a)  Is  C(r)  reflexive?  Complete?  Antisymmetric? 

(b)  Show  that  C(r)  is  not  transitive,  by  considering  the  famous  Condorcet  profile 
for  A  =  {1,2,3}  and  A  =  {a,  b,  c}\  aR\bR\c,  bRicRia,  cR^aR^b. 

(c)  Call  a  a  Condorcet  winner  if  \N(a,  b,r)\  >  \N(b,  a,  r)|  for  all  b  e  A  \  {a}.  Is 
there  a  Condorcet  winner  in  the  example  in  Sect.  11.1? 
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11.3.  Independence  of  the  Conditions  in  Theorem  11.1 

Show  that  the  conditions  in  Theorem  11.1  are  independent.  That  is,  exhibit  a  social 
welfare  function  that  is  Pareto  efficient  and  does  not  satisfy  IIA  or  dictatoriality,  and 
one  that  satisfies  IIA  and  is  not  dicatorial  nor  Pareto  efficient. 

11.4.  Independence  of  the  Conditions  in  Theorem  11.2 
Show  that  the  conditions  in  Theorem  1 1.2  are  independent. 

11.5.  Independence  of  the  Conditions  in  Theorem  11.3 
Show  that  the  conditions  in  Theorem  1 1.3  are  independent. 

11.6.  Copeland  Score  and  Kramer  Score 

The  Copeland  score  of  an  alternative  a  e  A  at  a  profile  r  =  (R\, ...  ,Rn)  e  CN  is 
defined  by 


c(a,  r)  =  \{b  e  A  \  N(a ,  b,  r)  >  N(b ,  a ,  r)}|  , 

i.e.,  the  number  of  alternatives  that  a  beats  (cf.  Problem  11.2).  The  Copeland  ranking 
is  obtained  by  ranking  the  alternatives  according  to  their  Copeland  scores. 

(a)  Is  the  Copeland  ranking  a  preference?  Is  it  antisymmetric?  Does  the  derived 
social  welfare  function  satisfy  IIA?  Pareto  efficiency? 

The  Kramer  score  of  an  alternative  a  e  A  at  a  profile  r  =  (R\, ...  ,Rn)  e  CN  is 
defined  by 


k(a,r)  —  min  \N(a,b,r)\  , 

b£A\{a} 

i.e.,  the  worst  score  among  all  pairwise  comparisons.  The  Kramer  ranking  is 
obtained  by  ranking  the  alternatives  according  to  their  Kramer  scores. 

(b)  Is  the  Kramer  ranking  a  preference?  Is  it  antisymmetric?  Does  the  derived  social 
welfare  function  satisfy  IIA?  Pareto  efficiency? 

11.7.  Two  Alternatives 

Show  that  Theorems  11.1-11.3  no  longer  hold  if  there  are  just  two  alternatives,  i.e., 
if  m  =  2. 


1 1 .5  Notes 

For  a  general  overview  of  social  choice  theory  see  Arrow  et  al.  (2002,  2011).  For 
Arrow’s  Theorem  see  Arrow  (1963).  For  the  Gibbard-Satterth waite  Theorem  see 
Gibbard  (1973)  and  Satterthwaite  (1975).  Both  these  theorems  are  closely  related: 


References 


203 


indeed,  the  proof  of  the  Gibbard-Satterth waite  Theorem  in  Gibbard  (1973)  uses 
Arrow’s  Theorem.  The  presentation  in  this  chapter  closely  follows  that  in  Reny 
(2001),  which  is  both  simple  and  elegant,  and  which  shows  the  close  relation 
between  the  two  results. 

For  the  Borda  scores  and  the  so-called  Borda  rule  see  de  Borda  (1781).  The 
Muller- Satterth waite  Theorem  is  from  Muller  and  Satterthwaite  (1977). 

De  Condorcet  (1785)  was  the  first  to  explicitly  discuss  the  notion  of  a  Condorcet 
winner;  see  Gehrlein  (2006)  for  a  comprehensive  study  of  the  so-called  Condorcet 
paradox  in  Problem  1 1.2(b). 
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Matrix  Games 
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In  this  chapter  we  study  finite  two-person  zero-sum  games — matrix  games — more 
rigorously.  In  particular,  von  Neumann’s  Minimax  Theorem  is  proved.  The  chapter 
extends  Chap.  2  in  Part  I.  Although  it  is  self-contained,  it  may  be  useful  to  (re)read 
Chap.  2  first. 

Section  12.1  presents  a  proof  of  the  Minimax  Theorem,  and  Sect.  12.2  shows 
how  a  matrix  game  can  be  solved — i.e.,  optimal  strategies  and  the  value  of  the  game 
can  be  found — by  solving  an  associated  linear  programming  problem. 


1 2.1  The  Minimax  Theorem 

A  two-person  zero-sum  game  is  completely  determined  by  a  single  matrix.  We 
repeat  Definition  2.1. 

Definition  12.1  (Matrix  Game)  A  matrix  game  is  an  m  x  n  matrix  A  of  real 
numbers,  where  m  is  the  number  of  rows  and  n  is  the  number  of  columns.  A  (mixed) 
strategy  of  player  1  is  a  probability  distribution  p  over  the  rows  of  A,  i.e.,  an  element 
of  the  set 


Am  :=  {p  =  (pi,...  ,pm)  e  Rm  |  =  1,  p,  >  0  for  all  i  =  1, . . .  ,m}  . 

i=  1 

Similarly,  a  (mixed)  strategy  of  player  2  is  a  probability  distribution  q  over  the 
columns  of  A,  i.e.,  an  element  of  the  set 

n 

An  :  —  {q  =  (q\ , . . . ,  qn)  e  Rn  |  qj  —  1 ,  qj  >  0  for  all  j  —  l, ...  ,n}  . 

7=1 
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A  strategy  p  of  player  1  is  called  pure  if  there  is  a  row  i  with  pt  —  1 .  This  strategy 
is  also  denoted  by  e? .  Similarly,  a  strategy  q  of  player  2  is  called  pure  if  there  is  a 
column  j  with  qj  —  1.  This  strategy  is  also  denoted  by  e7.  □ 

Let  A  be  an  m  x  n  matrix  game.  For  any  strategy  p  £  Am  of  player  1,  let  V\  (p)  = 
minqeA"  pAq.  It  is  easy  to  see  that  tq(p)  =  min^i pAe7,  since  pAq  is  a  convex 
combination  of  the  numbers  pAe7.  In  the  matrix  game  A  player  1  can  guarantee  a 
payoff  of  at  least 


v\ (A)  :=  max  fi(p)  . 

pG  Am 


Similarly,  for  any  strategy  q  £  A"  of  player  2  let  u2(q)  =  maxp€Am  pAq  = 
max/G{!  m}  e*Aq,  then  player  2  can  guarantee  to  have  to  pay  at  most 

v2(A)  :=  min  u2(q)  . 

qG  A” 


Intuitively,  player  1  should  not  be  able  to  guarantee  to  obtain  more  than  what  player 
2  can  guarantee  to  pay  maximally.  Indeed,  we  have  the  following  lemma. 

Lemma  12.2  For  any  m  x  n  matrix  game,  v\  (A)  <  v2(A). 

Proof  Problem  12.2.  ■ 

The  following  theorem  is  due  to  von  Neumann.  The  proof  is  based  on  Lemma  22.3, 
which  is  equivalent  to  Farkas’  Lemma. 

Theorem  12.3  (Minimax  Theorem  for  Matrix  Games)  For  any  m  x  n  matrix 
game  A,  V\(A)  =  v2  (A). 

Proof  Let  A  be  an  m  x  n  matrix  game.  In  view  of  Lemma  12.2  it  is  sufficient  to 
prove  that  v\  (a)  >  v2(A).  Suppose,  to  the  contrary,  that  v\  (A)  <  v2(A).  We  derive  a 
contradiction,  which  completes  the  proof  of  the  theorem. 

Let  B  be  any  arbitrary  m  x  n  matrix  game.  Then  either  (i)  or  (ii)  in  Lemma  22.3 
has  to  hold  for  B ,  i.e.,  exactly  one  of  the  following  holds: 

(i)  There  are  y  £  W1  and  z  £  Wn  with  (y,  z)  >  0,  (y,  z)  ^  0  and  By  +  z  =  0. 

(ii)  There  is  an  x  £  Mm  with  x  >  0  and  xB  >  0. 

First  suppose  that  (i)  holds  and  let  y  £  W1  and  z  £  Mm  with  (y,  z)  >  0,  (y.  z)  ±  o 
and  By  +  z  =  0.  It  cannot  be  the  case  that  y  =  0,  since  that  would  imply  that  also 
z  =  0,  a  contradiction.  Hence  YTk=\ 3^  >  0.  Define  q  £  A”  by  qj  —  yj/  Jfk=i  3^  f°r 
every  j  —  l, ...  ,n.  Then  Bq  =  — z/  YTk=\  37^  —  0-  Hence  f2(q)  A  0,  and  therefore 
v2(B)  <  0. 
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Suppose  instead  that  (ii)  holds.  Then  there  is  an  x  e  Mm  with  x  >  0  and  xB  >  0. 
Define  p  e  Am  by  p  =  x/  J2?=i  xi>  then  ui(p)  >  0  and  therefore  v\ (B)  >  0. 

We  conclude  that,  for  any  matrix  game  B ,  it  is  not  possible  to  have  V\ ( B )  <  0  < 
v2(B). 

Let  now  B  be  the  matrix  game  arising  by  subtracting  the  number  V\  (A)  from  all 
entries  of  A.  Then,  clearly,  v\  (B)  =  v\(A)  —  v\(A)  —  0andu2  (B)  —  V2(A)  —  V\(A)  > 
0.  Hence,  V\ (B)  <  0  <  V2 (B),  which  is  the  desired  contradiction.  ■ 

In  view  of  Theorem  12.3  we  can  define  the  value  of  the  game  A  by  v(A)  =  Ui(A)  = 
V2 (A).  An  optimal  strategy  of  player  1  is  a  strategy  p  such  that  Ui(p)  >  u(A). 
Similarly,  an  optimal  strategy  of  player  2  is  a  strategy  q  such  that  U2(q)  <  v(A). 
Theorem  12.3  implies  that  fi(p)  =  U2(q)  =  v(A)  for  such  optimal  strategies.  Thus, 
if  p  is  an  optimal  strategy  for  player  1,  then  V\(p)  =  maxp/eA,??  ^(pO*  so  that  p 
is  a  maximin  strategy  (cf.  Definition  2.3).  Conversely,  every  maximin  strategy  is 
an  optimal  strategy  for  player  1 .  Similarly,  the  optimal  strategies  for  player  2  are 
exactly  the  minimax  strategies. 

For  computation  of  optimal  strategies  and  the  value  of  matrix  games  in  some 
special  cases,  see  Chap.  2  and  Problems  12.3  and  12.4.  In  general,  matrix  games 
can  be  solved  by  linear  programming.  This  is  demonstrated  in  the  next  section. 


1 2.2  A  Linear  Programming  Formulation 


Let  A  be  an  m  x  n  matrix  game: 


&\n 


\ 


Adding  the  same  number  to  all  entries  of  A  changes  the  value  by  that  same  number 
but  not  the  optimal  strategies  of  the  players.  So  we  may  assume  without  loss  of 
generality  that  all  entries  of  A  are  positive.  We  define  the  (m+l)x(«+l)  matrix 
B  as  follows: 


/an  •••  a\n  — 1\ 

•  .  •  • 

B  —  :  :  : 

am\  ' ' '  cimn  1 

V-i  •••  -1  0  / 

Let  b  =  (0, ....  0,  — 1)  e  R'1+1  and  c  =  (0, ....  0,  — 1)  e  Rm+1.  Define  V  := 
{x  €  Mm+1  |  xB  >  b,  x  >  0}  and  W  {y  €  R',+  1  |  By  <  c,  y  >  0}.  It  is  easy  to 
check  that  V,  W  ^  0.  The  Duality  Theorem  of  Linear  Programming  (Theorem  22.6) 
therefore  implies: 
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Corollary  12.4  min{x  •  c 


xeV}  =  max{b  •  y 


y  e  w}. 


The  minimum  and  maximum  problems  in  this  corollary  are  so-called  linear  pro¬ 
gramming  (LP)  problems.  If  we  call  the  minimization  problem  the  primal  problem, 
then  the  maximization  problem  is  the  dual  problem — or  vice  versa.  The  common 
minimum/maximum  is  called  the  value  of  the  LP,  and  x  and  y  achieving  the  value 
are  called  optimal  solutions.  Denote  the  sets  of  optimal  solutions  by  Omin  and  Omax, 
respectively. 

We  have  the  following  result.  The  proof  uses  Lemma  22.9,  which  states  that  if 
x  G  V  and  y  G  W  satisfy  x  •  c  =  b  •  y,  then  x  and  y  are  optimal  solutions. 


Theorem  12.5  Let  A  be  an  m  x  n  matrix  game  with  all  entries  positive. 

(0  If  P  e  Am  is  an  optimal  strategy  for  player  1  and  q  G  An  is  an  optimal  strategy 
for  player  2  in  A,  then  (p,  v  (A))  G  Omin  and  (q,  v  (A))  G  Omax.  The  value  of  the 
LP  is  —v(A). 

00  If x  =  (xi,...,xm,xm+i)  G  Omin  and  y  =  (yu . . .  ,yn,yn+i)  €  Omax,  then 
(vi , . . . ,  xm)  is  an  optimal  strategy  for  player  1  in  A,  (yi , . . . ,  yn)  is  an  optimal 
strategy  for  player  2  in  A,  and  v(A)  =  xm+  \  =  yn+\. 


Proof 

(i)  Let  p  g  Am  and  q  G  An  be  optimal  strategies  in  the  matrix  game  A.  Then 
pAe7  >  v(A)  and  elAq  <  v(A)  for  all  i  —  1 , ...  ,m  and  j  —  l, ...  ,n.  Since 
all  entries  of  A  are  positive  and  therefore  v  (A)  >  0,  this  implies  (p,  v(A))  G  V 
and  (q,  v(A))  G  W.  Since  (p,  v (A))  •  c  =  —v(A)  and  (q,  v(A))  •  b  =  —v(A), 
Lemma  22.9  implies  that  the  value  of  the  LP  is  —v(A),  (p,  v(A))  g  Omin  and 
(q,  u(A))  G  6>max. 

(ii)  Let  x  =  (x\, . . . ,  xm+i)  G  Omin.  Since  x  •  c  =  —v(A)  by  (i),  we  have  xm+\  — 
v(A).  Since  xB  >  b,  we  have  (x\ , . . . ,  vm)Ae/  >  v(A)  for  all  j  =  1 , . . . ,  n ,  x>  >  0 
for  all  i  —  1, . . . ,  m,  and  Y^?=\  xi  —  1-  Suppose  that  Yl?=\  xi  K  1-  Obviously, 
J2?=ixi  >  otherwise  x  =  (0, . . . ,  0,  f(A))  ^  V  since  v(A)  >  0.  Then, 
letting  t  —  Q27=i  x0_1  >  1,  we  have  tx  g  V  and  tx  •  c  =  — tv(A )  <  —  v(A), 
contradicting  x  G  Omm.  Hence,  J2?=\xi  ~  0  an<^  (^i»  •  •  • , x>n)  is  an  optimal 
strategy  of  player  1  in  A. 

The  proof  of  the  second  part  of  (ii)  is  analogous.  ■ 

The  interest  of  this  theorem  derives  from  the  fact  that  solving  linear  programming 
problems  is  a  well  established  area.  Thus,  one  can  apply  any  (computer)  method  for 
solving  LPs  to  find  the  value  and  optimal  strategies  of  a  matrix  game. 

By  slightly  modifying  part  (ii)  of  the  proof  of  Theorem  12.5,  we  can  in  fact  derive 
the  Minimax  Theorem  from  the  Duality  Theorem  (Problem  12.5).  Conversely, 
with  each  LP  we  can  associate  a  matrix  game  and  thereby  derive  the  Duality 
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Theorem  from  the  Minimax  Theorem.  This  confirms  the  close  relationship  between 
linear  programming  (Duality  Theorem)  and  the  theory  of  matrix  games  (Minimax 
Theorem). 


12.3  Problems 

12.1.  Solving  a  Matrix  Game 
Consider  the  matrix  game 


/6  4  2  1\ 

,  5  3  3  2 

10  3  4 

\2  -3  2  3/ 

(a)  Reduce  the  game  by  iterated  elimination  of  strictly  dominated  strategies. 
Describe  exactly  which  pure  strategy  you  eliminate  each  time,  and  by  which 
pure  or  mixed  strategy  the  strategy  to  be  eliminated  is  strictly  dominated.  [See 
Chap.  2,  also  for  the  following  questions.] 

Denote  the  reduced  game  derived  in  (a)  by  B. 

(b)  Solve  B  graphically.  Explicitly  compute  V\(p)  and  U2(q)  for  strategies  p  of 
player  1  and  q  of  player  2.  Determine  the  value  of  B  and  the  optimal  strategy  or 
strategies  of  players  1  and  2  in  B. 

(c)  Determine  the  value  of  A  and  the  optimal  strategy  or  strategies  of  players  1  and 
2  in  A. 

I,  so  that  we  obtain  the  matrix  game 

4  2  1\ 

3  3  2 

0  3  4' 

-3  2  3/ 

(d)  Compute  v  (Ay)  and  the  optimal  strategies  in  Ay  for  every  y  e  R. 


Now  change  the  entry  an  from  6  to  y  e 

(y 

x-  \ 
\2 


12.2.  Proof  of  Lemma  12.2 
Prove  Lemma  12.2. 


212 


12  Matrix  Games 


12.3.  2x2  Games 
Consider  the  2x2  matrix  game 


^  ^12 
\  <221  <222 

Assume  that  A  has  no  saddlepoints.  [A  saddlepoint  is  an  entry  (ij)  such  that  atj  is 
maximal  in  column  j  and  minimal  in  row  i,  cf.  Definition  2.4.] 


(a)  Assume  that  an  >  an.  Show  that 


<2 1 2  <  <222 >  <221  <  <222,  <2ll  >  <221  • 

(b)  Show  that  the  unique  optimal  strategies  p  and  q  and  the  value  of  the  game  are 
given  by: 


J A*  _  A*Jt  _  |A| 

P  _  JA*F  ’  q  ~~  3A*3T  ’  V(  ~  3A*3t  ’ 

where  A*  is  the  adjoint  matrix  of  A,  i.e., 

_  f  <222  —  <212  A 
\  <221  <211  ) 

|A|  is  the  determinant  of  A,  and  J  :=  (1, 1). 

12.4.  Symmetric  Games 

An  m  x  n  matrix  game  A  =  (ay)  is  called  symmetric  if  m  —  n  and  ay  —  —ap  for  all 
ij  —  l, ...  ,m.  Prove  that  the  value  of  a  symmetric  game  is  zero  and  that  the  sets  of 
optimal  strategies  of  players  1  and  2  coincide. 

12.5.  The  Duality  Theorem  Implies  the  Minimax  Theorem 

Modify  the  proof  of  part  (ii)  of  Theorem  12.5  in  order  to  derive  the  Minimax 
Theorem  from  the  Duality  Theorem.  [Hint:  first  show  that  the  value  of  the  LP  must 
be  negative.] 

12.6.  Infinite  Matrix  Games 

Consider  the  following  two-player  game.  Each  player  mentions  a  natural  number. 
The  player  with  the  higher  number  receives  one  Euro  from  the  player  with  the  lower 
number.  If  the  numbers  are  equal  then  no  player  receives  anything. 


1 J  denotes  the  row  vector  and  Jr  the  transpose,  i.e.,  the  column  vector.  In  general,  we  omit  the 
transpose  notation  if  confusion  is  unlikely. 
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(a)  Write  this  game  in  the  form  of  an  infinite  matrix  game  A. 

(b)  Compute  supp  infq  pAq  and  infq  supp  pAq,  where  p  and  q  are  probability 
distributions  over  the  rows  and  the  columns  of  A,  respectively.  (Conclude  that 
this  game  has  no  ‘value’.) 

12.7.  Equalizer  Theorem 

Let  v  be  the  value  of  the  m  x  ^-matrix  game  A,  and  suppose  that  pAe77  =  v  for  every 
optimal  strategy  p  of  player  1 .  Show  that  player  2  has  an  optimal  strategy  q  with 
qn  ^  o. 


1 2.4  Notes 

The  Minimax  Theorem  was  first  proved  in  von  Neumann  (1928).  The  simplex 
algorithm  was  developed  by  George  Dantzig  in  1947;  see  Dantzig  (1963)  or  any 
textbook  on  linear  programming  or  operations  research. 

With  each  linear  programming  problem  we  can  associate  a  matrix  game  and 
thereby  derive  the  Duality  Theorem  from  the  Minimax  Theorem:  see,  e.g.,  Owen 
(1995). 

For  Problem  12.7  see  also  Raghavan  (1994). 
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This  chapter  builds  on  Chap.  3,  where  we  studied  finite  two  person  games — bimatrix 
games.  (Re)reading  Chap.  3  may  serve  as  a  good  preparation  for  the  present  chapter, 
which  offers  a  more  rigorous  treatment  of  finite  games,  i.e.,  games  with  finitely 
many  players — often  two — who  have  finitely  many  pure  strategies  over  which 
they  can  randomize.  We  only  discuss  games  with  complete  information.  In  the 
terminology  of  Chap.  5,  each  player  has  only  one  type. 

In  Sect.  13.1  a  proof  of  Nash’s  existence  theorem  is  provided.  Section  13.2  goes 
deeper  into  bimatrix  games.  In  Sect.  13.3  the  notion  of  iterated  dominance  is  studied, 
and  its  relation  with  rationalizability  indicated.  Sections  13.4-13.6  present  some 
basics  about  refinements  of  Nash  equilibrium.  Section  13.7  is  on  correlated  equilib¬ 
rium  in  bimatrix  games,  and  Sect.  13.8  concludes  with  an  axiomatic  characterization 
of  Nash  equilibrium  based  on  a  reduced  game  (consistency)  condition. 


1 3.1  Existence  of  Nash  Equilibrium 

We  start  with  a  general  definition  of  a  finite  game.  Matrix  and  bimatrix  games  are 
special  cases. 

A  finite  game  is  a  In  +  1 -tuple 

G  —  (N ,  *S3 , . . . ,  Sn ,  u\,  ...  ,  un)  , 


where 

•  N  =  {l, ...  ,n},  with  n  e  N,  is  the  set  of  players ; 

•  for  every  i  e  N ,  5)  is  the  finite  pure  strategy  set  of  player  i; 

•  for  every  i  e  N,  ut  :  S  —  S\  x  . . .  x  Sn  ->  R  is  the  payoff  function  of  player  i\  i.e., 
for  every  pure  strategy  combination  (s\, ...  ,sn)  e  S  where  s  \  e  S\, ...  ,sn  e  Sn, 
Ui(s\ , ...  ,sn)  e  R  is  player  f s  payoff. 


©  Springer- Verlag  Berlin  Heidelberg  2015 

H.  Peters,  Game  Theory,  Springer  Texts  in  Business  and  Economics, 
DOI  10.1007/978-3-662-46950-7  13 


215 


216 


13  Finite  Games 


This  definition  is  identical  to  the  definition  of  an  ^-person  game  in  Chap.  6,  except 
that  the  pure  strategy  sets  are  now  finite.  The  elements  of  5)  are  the  pure  strategies 
of  player  i.  A  (mixed)  strategy  of  player  i  is  a  probability  distribution  over  Si.  The 
set  of  (mixed)  strategies  of  player  i  is  denoted  by  A  (Si).  Observe  that,  whenever  we 
talk  about  a  strategy,  we  mean  a  mixed  strategy  (which  may  of  course  be  pure). 

Let  (cr  i , . . . ,  on)  G  A  (Si)  x  . . .  x  A  (Sn)  be  a  strategy  combination.  Player  is 
payoff  from  this  strategy  combination  is  defined  to  be  his  expected  payoff.  With 
some  abuse  of  notation  this  is  also  denoted  by  ufo\ , . . . ,  on).  Formally, 


li  •  •  •  5  @r i)  —  ^  '  I  |  |  P/'(^/)  I  Ui{S\,  .  .  .  ,  Sn )  . 

(si,...,Sn)€S  \j€N  J 

For  a  strategy  combination  o  and  a  player  i  e  N  we  denote  by  (a/ ,  a_/)  the  strategy 
combination  in  which  player  i  plays  o'  G  A  (Si)  and  each  player  j  ^  i  plays  Oj. 

A  best  reply  of  player  i  to  the  strategy  combination  a_z  of  the  other  players  is  a 
strategy  07  G  A  (5))  such  that  ufoi,  o-i )  >  ufo',  o-i )  for  all  a/  G  A (5/). 

A  Nash  equilibrium  of  G  is  a  strategy  combination  a*  G  ]~[-GyV  A  (Si)  such  that 
for  each  player  i,  cr*  is  a  best  reply  to  crL. 

As  in  Chaps.  3  and  6,  /3;  denotes  player  V s  best  reply  correspondence.  That  is, 
pi  :  ]~ j^i  A (5/)  ->  A(N/)  assigns  to  each  strategy  combination  of  the  other 
players  the  set  of  all  best  replies  of  player  i. 

Theorem  13.1  (Existence  of  Nash  Equilibrium)  Every  finite  game  G  — 
(N,  S\ , . . . ,  Sn,  u\ , . . . ,  un)  has  a  Nash  equilibrium. 


The  proof  of  this  theorem  below  is  based  on  the  Kakutani  Fixed  Point  Theorem 
(Sect.  22.5). 

Proof  of  Theorem  13.1  Consider  the  correspondence 

p  :  P[  A  (S',)  -*  Y[  MSi),  (ffi,  ...,an)\->  Y[Pi(ai>-  ■  •  ,or;-i,o'i+i,  •  •  -,cr„)  . 

ieN  ieN  ieN 

This  correspondence  is  convex- valued  and  upper  semi-continuous  (Problem  13.2). 
By  the  Kakutani  Fixed  Point  Theorem  (Theorem  22.11)  it  has  a  fixed  point  a*.  By 
definition  of  /3,  any  fixed  point  is  a  Nash  equilibrium  of  G.  ■ 

An  alternative  proof  is  obtained  by  using  the  Brouwer  Fixed  Point  Theorem 
(Theorem  22.10).  See  Problem  13.1. 
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1 3.2  Bimatrix  Games 

Two-person  finite  games — bimatrix  games — were  studied  in  Chap.  3.  Here  we 
present  some  extensions.  In  Sect.  13.2.1  we  give  some  formal  relations  between  pure 
and  mixed  strategies  in  a  Nash  equilibrium.  In  Sect.  13.2.2  we  extend  the  graphical 
method  for  computing  Nash  equilibria  (cf.  Sect.  3.2.2).  In  Sect.  13.2.3  a  general 
mathematical  programming  method  is  described  by  which  equilibria  of  bimatrix 
games  can  be  found.  Section  13.2.4  reconsiders  matrix  games  as  a  special  kind  of 
bimatrix  games.  Section  13.2.5  is  about  Zermelo’s  theorem  on  the  game  of  chess. 


1 3.2.1  Pure  and  Mixed  Strategies  in  Nash  Equilibrium 

Let  (A,5)be  anmxn  bimatrix  game  (Definition  3.1).  The  first  lemma  implies  that 
to  determine  whether  a  strategy  pair  is  a  Nash  equilibrium  it  is  sufficient  to  compare 
the  expected  payoff  of  a  (mixed)  strategy  with  the  payoffs  of  pure  strategies. 

Lemma  13.2  Let  p  G  A777  and  q  G  A77.  Then  p  G  ft  \  (q)  if  and  only  ifpAq  >  e7Aq 
for  all  i  —  l, ...  ,m;  and  q  G  /?2(p)  if  and  only  ifpBq  >  p  for  allj—  l, ...  ,n. 

Proof  Problem  13.3.  ■ 

The  next  lemma  says  that  a  player  always  has  a  pure  best  reply  against  any  strategy 
of  the  opponent. 

Lemma  13.3  Let  p  G  A777  and  q  G  An.  Then  there  is  an  i  G  {1 , ...  ,m}  with 
e7  G  (q)  and  aje  { 1 , . . . ,  n)  with  e7  G  /^(p). 

Proof  Problem  13.4.  ■ 

In  light  of  these  lemmas  it  makes  sense  to  introduce  the  pure  best  reply  correspon¬ 
dences. 

Definition  13.4  Let  (A,  B)  be  an  mx  n  bimatrix  game  and  let  p  G  Am  and  q  G  A77 . 
Then 


PZ?i(q)  =  {i  e  {l, ...  ,m}  \  e7Aq  =  maxeAAq} 

k 

is  the  set  of  pure  best  replies  of  player  1  to  q  and 

P B2 (p)  =  {j  G  {1, . . . ,  n}  |  p B^  =  maxp Bek} 

k 

is  the  set  of  pure  best  replies  of  player  2  to  p.  □ 
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Observe  that,  with  some  abuse  of  notation,  the  pure  best  replies  in  this  definition  are 
labelled  by  the  row  and  column  numbers. 

The  carrier  C(p)  of  a  mixed  strategy  p  £  Ak,  where  k  e  N,  is  the  set  of 
coordinates  that  are  positive,  i.e., 


C(p)  =  {/  e  {1, . ..  ,k}  I  Pi  >  0}  . 


The  next  lemma  formalizes  the  observation  used  already  in  Chap.  3,  namely  that 
in  a  best  reply  a  player  puts  positive  probability  only  on  those  pure  strategies  that 
maximize  his  expected  payoff  (cf.  Problem  3.8). 

Lemma  13.5  Let  (A,  B)  be  an  m  x  n  bimatrix  game,  p  £  Am  and  q  £  A".  Then 

P  e  /Mq)  ^  C(p)  C  PBi(q) 


and 


q  e  P2(P)  O  C(q)  c  PB2( p)  . 

Proof  We  only  show  the  first  equivalence. 

First  let  p  £  /3 \  (q),  and  assume  i  £  C(p)  and,  contrary  to  what  we  want  to  prove, 
that  elAq  <  max^  eAAq.  Then 


m  m 

pAq  =  maxeAAq  =  /y^Pk'  maxeAAq  >  2~^^eAAq  =  pAq  , 

k  '  ^  k  '  ^ 

k'=  1  k=  1 

where  the  first  equality  follows  from  Lemma  13.3.  This  is  a  contradiction,  hence 
elAq  =  max£  eAAq  and  i  £  PB i(q). 

Next,  assume  that  C(p)  c  PB\( q).  Then 


m 


pAq  =  ^p/e'Aq  =  ^  P/e'Aq  =  ^  pt  maxeAAq  =  maxeAAq 


i=  1 


/€C(p) 


/€C(p) 


So  pAq  >  elAq  for  all  i  —  1 , . . . ,  m,  which  by  Lemma  13.2  implies  p  £  /3\  (q).  ■ 


The  following  corollary  is  an  immediate  consequence  of  Lemma  13.5.  It  is,  in 
principle,  helpful  to  find  Nash  equilibria  or  to  determine  whether  a  given  strategy 
combination  is  a  Nash  equilibrium.  See  Example  13.7. 

Corollary  13.6  A  strategy  pair  (p,  q)  is  a  Nash  equilibrium  in  a  bimatrix  game 
(A,  B)  if  and  only  if  C{ p)  c  PZ^q)  and  C(q)  c  PP2(p)- 
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Example  13.7  Consider  the  bimatrix  game 


(A,B)  = 


(1,1 

0,1 

0,1 

0,1\ 

1.1 

i,i 

0,1 

0,1 

1,1 

i,i 

1,1 

0,1 

V  V 1 

i,i 

1,1 

hi) 

and  the  strategies  p  =  (0,  and  q  =  0, 0).  Since 


3  ’  3’  3 


2  ’  2 


Aq  = 


(A 

i 
i 

Vi ) 


and  pP=(llll)  , 


PPi(q)  =  {2,  3,  4}  and  PP2(p)  —  {1, 2,  3,  4}.  Since  C(p)  =  {2,  3,  4}  and  C(q)  = 
{1,2},  we  have  C(p)  c  PPi(q)  and  C(q)  c  PP2( p).  So  Corollary  13.6  implies  that 
(p,  q)  is  a  Nash  equilibrium. 


1 3.2.2  Extension  of  the  Graphical  Method 

In  Sect.  3.2.2  we  learnt  how  to  solve  2x2  bimatrix  games  graphically.  We  now 
extend  this  method  to  2  x  3  and  3x2  games.  For  larger  games  it  becomes  impractical 
or  impossible  to  use  this  graphical  method. 

As  an  example  consider  the  2x3  bimatrix  game 


(£,B)  = 


( 2,1  1,0  i,n 

V2,0  1, 1  0,0 J  ■ 


The  Nash  equilibria  of  this  game  are  elements  of  the  set  A2  x  A3  of  all  possible 
strategy  combinations.  This  set  can  be  represented  as  in  Fig.  13.1. 

Here  player  2  chooses  a  point  in  the  triangle  with  vertices  ei,  e2  and  e2,  while 
player  1  chooses  a  point  of  the  horizontal  line  segment  with  vertices  ei  and  e2. 

In  order  to  determine  the  best  replies  of  player  1  note  that 
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Fig.  13.1  The  set  A2  x  A3 


Fig.  13.2  The  best  reply 
correspondence  of  player  1 
(shaded) 


As  eiAq  =  e2Aq  <£>  g3  =  0,  it  follows  that 

a<H  S’  ££=°o. 

This  yields  the  best  reply  correspondence  represented  in  Fig.  13.2. 
Similarly, 

P B  =  (pi  p2  pi ) 


implies 


Pi(p)  = 


{e2} 

A3 

{q  €  A3 


<72 


if  p  i  <  p2 
if  Pi  =  P2 
0}  if  px  >  p2. 


This  yields  the  best  reply  correspondence  represented  in  Fig.  13.3. 

Figure  13.4  represents  the  intersection  of  the  two  best  reply  correspondences  and, 
thus,  the  set  of  Nash  equilibria. 
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Fig.  13.3  The  best  reply 
correspondence  of  player  2 
(shaded/thick) 


Fig.  1 3.4  The  set  of  Nash 
equilibria: 

{((1,0),  q)  |92  =  0}U 
{(P.  (1.0.0))  1 1  >PI  >  P 
U{((|,|),q)|«3  =  0}U 
{(p,  (0, 1,0))  |  \  >P\  >  0} 


e 


i 


(1  i) 
V  2  ’  2  ’ 


e 


2 


13.2.3  A  Mathematical  Programming  Approach 

In  Sect.  12.2  we  have  seen  that  matrix  games  can  be  solved  by  linear  programming. 
Nash  equilibria  of  an  m  x  n  bimatrix  game  (A,  B )  can  be  found  by  considering  the 
following  quadratic  programming  problem: 

max  /( p,  q,  a ,  b )  :=  pAq  +  p/?q  —  a  —  b 

p£  Am,  qeA",  aJoCi R 


subject  to  elAq  <  a  for  all  i  =  1 , 2, . . . ,  m 
p  B&  <  b  for  ally  =  1, 2, . . . ,  n  . 

Theorem  13.8  The  following  two  statements  are  equivalent: 

(1)  (p,  q,  a ,  b)  is  a  solution  of  (13.1) 

(2)  (p,  q)  is  a  Nash  equilibrium  of  (A,  B),  a  —  pAq,  b  =  p/?q. 


Proof  Problem  13.9. 
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If  (A,  B )  is  a  zero-sum  game,  i.e.,  if  B  —  —A,  then  (13.1)  reduces  to 


max  —a  —  b 

peAm,  qeA",  a,£>GR 

subject  to  elAq  <  a  for  all  i  —  1,2, ...  ,m 
— pAe7  <  b  for  ally  =  1 , 2, . . . ,  n  . 

Program  (13.2)  can  be  split  up  into  two  independent  programs 


max  —a 

q€A",  aeR 


and 


subject  to  elAq  <  a  for  all  i  =  1 , 2, . . . ,  m 


(13.2) 


(13.3) 


min  b 

peAm,  freR 

subject  to  pAe7  >  —  b  for  all  j  =  1, 2, . . . ,  n  .  (13.4) 

One  can  check  that  these  problems  are  equivalent  to  the  LP  and  its  dual  for  matrix 
games  in  Sect.  12.2,  see  Problem  13.10. 


13.2.4  Matrix  Games 

Since  matrix  games  are  also  bimatrix  games,  everything  that  we  know  about 
bimatrix  games  is  also  true  for  matrix  games.  In  fact,  the  Minimax  Theorem  (Theo¬ 
rem  12.3)  can  be  derived  directly  from  the  existence  theorem  for  Nash  equilibrium 
(Theorem  13.1).  Moreover,  each  Nash  equilibrium  in  a  matrix  game  consists  of  a 
pair  of  optimal  (maximin  and  minimax)  strategies,  and  each  such  pair  is  a  Nash 
equilibrium.  As  a  consequence,  in  a  matrix  game,  Nash  equilibrium  strategies  are 
exchangeable — there  is  no  coordination  problem,  and  all  Nash  equilibria  result  in 
the  same  payoffs. 

All  these  facts  are  collected  in  the  following  theorem.  For  terminology  concern¬ 
ing  matrix  games  see  Chap.  12.  The  ‘new’  contribution  of  this  theorem  is  part  (2), 
part  (1)  is  just  added  to  provide  an  alternative  proof  of  the  Minimax  Theorem. 

Theorem  13.9  Let  Abe  an  rax  n  matrix  game.  Then: 

(1)  vi (A)  =  v2(A). 

(2)  A  pair  (p*,  q*)  e  Am  x  An  is  a  Nash  equilibrium  of  (A,  —A)  if  and  only  if  p*  is 
an  optimal  strategy  for  player  1  in  A  and  q*  is  an  optimal  strategy  for  player  2 
in  A. 
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Proof 

(1)  In  view  of  Lemma  12.2  it  is  sufficient  to  prove  that  tq(A)  >  V2 (A).  Choose 
(p*,q*)  G  Am  x  An  to  be  a  Nash  equilibrium  of  (A,  —A) — this  is  possible  by 
Theorem  13.1.  Then 

pAq*  <  p*Aq*  <  p*Aq  for  all  p  G  Am,  q  G  An  . 

This  implies  maxp  pAq*  <  p*Aq  for  all  q,  hence  V2  (A)  =  minq/  maxp  pAqr  < 
p*Aq  for  all  q.  So 

v2  O)  <  min  p*Aq  <  max  min  pAq  =  Vi(A)  . 
q  P  q 

(2)  First,  suppose  that  (p*,  q*)  G  Am  x  An  is  a  Nash  equilibrium  of  (A,  —A).  Then 

p*Aq*  =  maxpAq*  =  ^(q*)  >  rninr^q)  =  V2 (A)  =  t;(A)  . 

p  q 

If  p*  were  not  optimal,  then  p*Aq  <  v(A)  for  some  q  G  An ,  so  p*Aq*  < 
p*Aq  <  v(A ),  a  contradiction.  Similarly,  q*  must  be  optimal. 

Conversely,  suppose  that  p*  and  q*  are  optimal  strategies.  Since  pAq*  < 
v(A)  for  all  p  G  Am  and  p*Aq  >  v(A)  for  all  q  G  A",  it  follows  that  p*  and  q* 
are  mutual  best  replies  and,  thus,  (p*,  q*)  is  a  Nash  equilibrium  in  (A,  —A).  ■ 


1 3.2.5  The  Game  of  Chess:  Zermelo's  Theorem 

One  of  the  earliest  formal  results  in  game  theory  is  Zermelo’s  Theorem  on  the  game 
of  chess.  In  this  subsection  we  provide  a  simple  proof  of  this  theorem,  based  on 
Theorem  13.9. 

The  game  of  chess  is  a  classical  example  of  a  zero-sum  game.  There  are  three 
possible  outcomes:  a  win  for  White,  a  win  for  Black,  and  a  draw.  Identifying  player 
1  with  White  and  player  2  with  Black,  we  can  associate  with  these  outcomes  the 
payoffs  (1,-1),  (—1,1),  and  (0,0),  respectively.  In  order  to  guarantee  that  the 
(extensive  form)  game  stops  after  finitely  many  moves,  we  assume  the  following 
stopping  rule:  if  the  same  configuration  on  the  chess  board  has  occurred  more  than 
twice,  the  game  ends  in  a  draw.  Since  there  are  only  finitely  many  configurations 
on  the  chess  board,  the  game  must  stop  after  finitely  many  moves.  Note  that  the 
chess  game  is  a  finite  extensive  form  game  of  perfect  information  and  therefore  it 
has  a  Nash  equilibrium  in  pure  strategies — see  Sect.  4.3.  To  be  precise,  this  is  a  pure 
strategy  Nash  equilibrium  in  the  associated  matrix  game,  where  mixed  strategies  are 
allowed  as  well. 
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Theorem  13.10  (Zermelo’s  Theorem)  In  the  game  of  chess,  either  White  has  a 
pure  strategy  that  guarantees  a  win,  or  Black  has  a  pure  strategy  that  guarantees  a 
win,  or  both  players  have  pure  strategies  that  guarantee  at  least  a  draw. 

Proof  Let  A  —  (atf)  denote  the  associated  matrix  game,  and  let  row  i*  and  column 
j*  constitute  a  pure  strategy  Nash  equilibrium.  We  distinguish  three  cases. 

Case  1.  apj*  —  1,  i.e.,  White  wins.  By  Theorem  13.9,  v(A )  =  1.  Hence,  White 
has  a  pure  strategy  that  guarantees  a  win,  namely  to  play  row  i* . 

Case  2.  at*j*  —  —1,  i.e.,  Black  wins.  By  Theorem  13.9,  v(A)  —  —1.  Hence, 
Black  has  a  pure  strategy  that  guarantees  a  win,  namely  to  play  column  j* . 

Case  3.  a^j*  —  0,  i.e.,  the  game  ends  in  a  draw.  By  Theorem  13.9,  v(A)  —  0. 
Hence,  both  White  and  Black  can  guarantee  at  least  a  draw  by  playing  row  i* 
and  column  y* ,  respectively.  ■ 


1 3.3  Iterated  Dominance  and  Best  Reply 

A  pure  strategy  of  a  player  in  a  finite  game  is  strictly  dominated  if  there  is  another 
(mixed  or  pure)  strategy  that  yields  always — whatever  the  other  players  do — a 
strictly  higher  payoff.  Such  a  strategy  is  not  played  in  a  Nash  equilibrium,  and  can 
therefore  be  eliminated.  In  the  smaller  game  there  may  be  another  pure  strategy  of 
the  same  or  of  another  player  that  is  strictly  dominated  and  again  may  be  eliminated. 
This  way  a  game  may  be  reduced  to  a  smaller  game  for  which  it  is  easier  to 
compute  the  Nash  equilibria.  If  the  procedure  results  in  a  unique  surviving  strategy 
combination  then  the  game  is  called  dominance  solvable,  but  this  is  a  rare  exception. 

We  applied  these  ideas  before,  in  Chaps.  2  and  3.  In  this  section  we  show, 
formally,  that  by  this  procedure  of  iterated  elimination  of  strictly  dominated 
strategies  no  Nash  equilibria  of  the  original  game  are  lost,  and  no  Nash  equilibria 
are  added. 

For  iterated  elimination  of  weakly  dominated  strategies  the  situation  is  different: 
Nash  equilibria  may  be  lost,  and  the  final  result  may  depend  on  the  order  of 
elimination.  See  Problem  3.6. 

We  start  with  repeating  the  definition  of  a  strictly  dominated  strategy  for  an 
arbitrary  finite  game. 

Definition  13.11  Let  G  —  ( N ,  53 , . . . ,  Sn,  u\ , . . . ,  un)  be  a  finite  game,  i  e  N,  Sj  e 
Si.  Strategy  S[  is  strictly  dominated  by  strategy  07  e  A  (Si)  if  ufot,  a_z)  >  ufsi,  o-t) 
for  all  ( 7-i  e  Ylj^i  A(57).  Strategy  y-  e  5/  is  strictly  dominated  if  it  is  strictly 
dominated  by  some  strategy  07  e  A  (Si).  □ 

The  fact  that  iterated  elimination  of  strictly  dominated  strategies  does  not  essentially 
change  the  set  of  Nash  equilibria  of  a  game  is  a  straightforward  consequence  of  the 
following  lemma. 
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Lemma  13.12  Let  G  —  (N,  S i, . . . ,  Sn,  u\, . . . ,  un)  be  a  finite  game ,  i  G  N, 
and  let  Si  G  Si  be  strictly  dominated.  Let  G'  —  (N,  S\, . . . ,  S/_i,  Si  \ 
{s;},  Si+ 1, . . . ,  Sn,  u[, . . . ,  u'n)  be  the  game  arising  from  G  be  eliminating  Si  from 
Si  and  restricting  the  pay  off  functions  accordingly.  Then: 

(1)  If  o  is  a  Nash  equilibrium  in  G,  then  Oi  (si)  —  0  ( where  ofisf)  is  the  probability 
assigned  by  Oi  to  pure  strategy  Si  G  Si )  and  o'  is  a  Nash  equilibrium  in  G' , 
where  o'  —  Oj  for  each  j  e  N  \  {i}  and  o'  is  the  restriction  of  Oi  to  Si  \  {s;}. 

(2)  If  o'  is  a  Nash  equilibrium  in  G' ,  then  o  is  a  Nash  equilibrium  in  G,  where 
Oj  —  o'  for  each  j  e  N  \  {i}  and  ofitf)  —  o-(q)  for  all  ti  G  Si  \  {s;}. 

Proof 

(1)  Let  a  be  a  Nash  equilibrium  in  G,  and  let  r;  e  A  (Si)  strictly  dominate  s^  If 
<7/0;)  >  0,  then 


Ui (<7/  -|-  <7/ (.S'; ) T; ,  <7—/ )  >  t//(<7/,  (7— /)  , 


where  <7/  :  Si  ->  R  is  defined  by  67  (ti)  —  off)  for  all  g  Si\{si}  and  ofsi)  —  0. 
This  contradicts  the  assumption  that  a  is  a  Nash  equilibrium  in  G.  Therefore, 
a/0/)  =  0.  With  o'  and  G'  as  above,  we  have 


^ i — \  •>  T  •>  ^ i~ |-1  ?  •  •  •  ?  ^n)  Ui(<J\  ,  .  .  .  ,  Oi — \ ,  X-  ,  a/-)- \ ,  .  .  .  ,  Off)  , 


for  every  z'  e  A (5/  \  0/}),  where  x'  e  A (Sf)  assigns  0  to  Si  and  is  equal 
to  x'  otherwise.  From  this  it  follows  that  o[  is  still  a  best  reply  to  o'_t.  It  is 
straightforward  that  also  for  each  j  i,  o'  is  still  a  best  reply  to  o'_-.  Hence  o' 
is  a  Nash  equilibrium  in  G' . 

(2)  Let  o'  and  o  be  as  in  (2)  of  the  lemma.  Obviously,  for  every  player  j  i,  oj  is 
still  a  best  reply  in  o  since  a/0/)  =  0,  i.e.,  player  i  puts  zero  probability  on  the 
new  pure  strategy  st.  For  player  i,  Oi  is  certainly  a  best  reply  among  all  strategies 
that  put  zero  probability  on  57.  But  then,  a/  is  a  best  reply  among  all  strategies, 
since  strategies  that  put  nonzero  probability  on  st  can  never  be  best  replies  by 
the  first  argument  in  the  proof  of  (1).  Hence,  a  is  a  Nash  equilibrium  in  G.  ■ 


Obviously,  a  strictly  dominated  pure  strategy  is  not  only  never  played  in  a  Nash 
equilibrium,  but,  a  fortiori,  is  never  (part  of)  a  best  reply.  Formally,  we  say  that 
a  pure  strategy  Si  of  player  i  in  the  finite  game  G  —  (N,  S\, . . . ,  Sn,  u\ , . . . ,  un)  is 
never  a  best  reply  if  for  all  (Ojfyi  and  all  a/  G  ^/((oj)^/),  we  have  ofsf)  —  0.  The 
following  result  shows  that  for  two-player  games  also  the  converse  holds. 


Theorem  13.13  In  a  finite  two-person  game  every  pure  strategy  that  is  never  a  best 
reply,  is  strictly  dominated. 
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Proof  Let  (A,  B)  be  an  m  x  n  bimatrix  game  and  suppose  without  loss  of  gen¬ 
erality  that  pure  strategy  e1  e  A 777  of  player  1  is  never  a  best  reply.  Let  b  = 
(-1,-1,..., -1)  G  IT. 

Let  A  be  the  (m—\)xn  matrix  with  z-th  row  equal  to  the  first  row  of  A  minus  the 
i  +  1-th  row  of  A,  i.e.,  atj  —  ay  —  a^ij  for  every  i  —  1 , . . . ,  m  —  1  and  j  —  1 , . . . ,  n. 
Thus, 


A  = 


/  an  —  an 
an  —  an 


a\n  a2n  ^ 

a\n  a^n 


V^n  an  i  •••  a\n  ann  / 


The  assumption  that  the  pure  strategy  e1  of  player  1  is  never  a  best  reply  is  equivalent 
to  the  statement  that  the  system 


Aq  = 


e*Aq  —  e2Aq\ 


elAq  —  enAqJ 


>  0,  q  £  A 


n 


has  no  solution.  This,  in  turn,  is  equivalent  to  the  statement  that  the  system 

Aq  >  0,  q  >  0,  q  •  b  <  0 

has  no  solution.  This  means  that  the  system  in  (2)  of  Lemma  22.7  (with  A  instead  of 
A  there)  has  no  solution.  Hence,  this  lemma  implies  that  the  system 

X  €  Rm_1,  xA  <  b,  x  >  0 

has  a  solution.  By  definition  of  b  and  A  we  have  for  such  a  solution  x  =  (x2, ,  xm) : 


m 


m 


x  >  0  and  ^  Xj  elA  >  ^  v,  e!A  +  (!,...,!) 


i= 2 


i= 2 


This  implies  that  x  ^  0  and  therefore  that  e1  is  strictly  dominated  by  the  strategy 


m 


m 


0,x2/y^jxi, . .  .,xm/  G  Am 


i=  2 


i= 2 


Hence,  e1  is  strictly  dominated. 


For  games  with  more  than  two  players  Theorem  13.13  does  not  hold,  see  Prob¬ 
lem  13.13  for  a  counterexample. 
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The  concept  of  ‘never  a  best  reply’  is  closely  related  to  the  concept  of  ratio- 
nalizability.  Roughly,  rationalizable  strategies  are  strategies  that  survive  a  process 
of  iterated  elimination  of  strategies  that  are  never  a  best  reply.  Just  like  the  strate¬ 
gies  surviving  iterated  elimination  of  strictly  dominated  strategies,  rationalizable 
strategies  constitute  a  set- valued  solution  concept.  The  above  theorem  implies  that 
for  two-player  games  the  two  solution  concepts  coincide.  In  general,  the  set  of 
rationalizable  strategies  is  a  subset  of  the  set  of  strategies  that  survive  iterated 
elimination  of  strictly  dominated  strategies. 

The  implicit  assumption  justifying  iterated  elimination  of  strategies  that  are 
dominated  or  never  a  best  reply  is  quite  demanding.  Not  only  should  a  player  believe 
that  some  other  player  will  not  play  a  such  a  strategy,  but  he  should  also  believe  that 
the  other  player  believes  that  he  (the  first  player)  believes  this  and,  in  turn,  will  not 
use  such  a  strategy  in  the  reduced  game,  and  so  on  and  so  forth.  See  the  notes  at  the 
end  of  the  chapter  for  references. 


1 3.4  Perfect  Equilibrium 

Since  a  game  may  have  many,  quite  different  Nash  equilibria,  the  literature  has 
focused  since  a  long  time  on  refinements  of  Nash  equilibrium.  We  have  seen 
examples  of  this  in  extensive  form  games,  such  as  subgame  perfect  equilibrium 
and  perfect  Bayesian  equilibrium  (Chaps.  4  and  5).  One  of  the  earliest  and  best 
known  refinements  of  Nash  equilibrium  in  strategic  form  games  is  the  concept  of 
‘trembling  hand  perfection’.  This  refinement  excludes  Nash  equilibria  that  are  not 
robust  against  ‘trembles’  in  the  players’  strategies. 

Formally,  let  G  =  (N,  S\, . . . ,  Sn,  u\, . . . ,  un)  be  a  finite  game  and  let  /x  be  an 
error  function,  assigning  a  number  /X;/*  e  (0, 1)  to  every  i  e  N  and  h  e  Si,  such  that 
J2heSi  l1  Mi  <  1  f°r  everY  player  i.  The  number  /x^  is  the  minimum  probability  with 
which  player  i  is  going  to  play  pure  strategy  h,  perhaps  by  ‘mistake’  (‘trembling 
hand’).  Let,  for  each  i  e  N,  A  (Si, /i)  =  {07  e  A  (Si)  \  07  (/i)  >  /x^  for  all  h  e  Si}, 
and  let  G(/x)  denote  the  game  derived  from  G  by  assuming  that  each  player  i  may 
only  choose  strategies  from  A  {Si,  11).  The  game  G(/x)  is  called  the  /i-perturbed 
game.  Denote  the  set  of  Nash  equilibria  of  G  by  NE(G)  and  of  G(/x)  by  NE(G(fi)). 

Lemma  13.14  For  every  error  function  /x,  NE(G(/i))  0. 

Proof  Analogous  to  the  proof  of  Theorem  13.1.  ■ 

A  perfect  equilibrium  is  a  strategy  combination  that  is  the  limit  of  some  sequence  of 
Nash  equilibria  of  perturbed  games.  Formally: 

Definition  13.15  A  strategy  combination  a  is  a  perfect  equilibrium  if  there  is  a 
sequence  G(/rT),  t  e  N,  of  perturbed  games  with  p!  ->  0  for  t  ->  00  and  a  sequence 
of  Nash  equilibria  <jt  e  G(/iT)  such  that  ot  ->  a  for  t  ->  00.  □ 
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As  will  follow  from  Theorem  13.17  below,  a  perfect  equilibrium  is  a  Nash 
equilibrium.  So  the  expressions  perfect  equilibrium  and  perfect  Nash  equilibrium 
are  equivalent. 

Call  a  strategy  combination  a  in  G  completely  mixed  if  a,  ( h )  >  0  for  all  i  e  N 
and  h  e  Si. 

Lemma  13.16  A  completely  mixed  Nash  equilibrium  of  G  is  a  perfect  equilibrium. 

Proof  Problem  13.14.  ■ 

Also  if  a  game  has  no  completely  mixed  Nash  equilibrium,  it  still  has  a  perfect 
equilibrium. 

Theorem  13.17  Every  finite  game  G  —  ( N ,  S\, . . . ,  Sn,  u\, . . . ,  un)  has  a  perfect 
equilibrium.  Every  perfect  equilibrium  is  a  Nash  equilibrium. 

Proof  Take  any  sequence  (G(//))/x?^o  of  perturbed  games  and  ot  e  NE(G(p,t)) 
for  each  t  e  N.  Since  YlieN  A(S/)  is  a  compact  set  we  may  assume  without  loss  of 
generality  that  the  sequence  (<JT)TeN  converges  to  some  o  e  n,ev  A  (.S’,).  Then  o  is 
perfect.  It  is  easy  to  verify  that  a  e  NE(G).  ■ 

Example  13.18  Consider  the  bimatrix  game 

L  C  R 

U  /  1, 1  0,0  2, 0\ 

G  —  M  \  1,2  1,2  1,1  . 

D  \0,0  1,  1  1,1/ 

(Cf.  Problem  3.6.)  The  set  of  Nash  equilibria  in  this  game  is  the  union  of  the 
following  sets  (Problem  13.8): 

(i)  {((p,  0, 1  -  p ),  (0,  |))  |  0  <p  <  i}, 

(ii)  {((0, 0, 1),  (0,  q,  1  —  q))  \\<q<\}, 

(hi)  {((p,  l  —  p,  0),  (1,0, 0))  |  0  <  /?  <  1}, 

(iv)  {((0,  1,0),  (q,  1  —  q,  0))  |  0  <  q  <  1}, 

(v)  {((0,p,  1  -p),  (0, 1,0))  |  0  <p  <  1}. 

We  consider  these  collections  one  by  one.  The  Nash  equilibria  in  the  first  collection 
are  not  perfect,  for  the  following  reason.  In  any  perturbed  game,  player  1  plays  each 
pure  strategy  with  positive  probability.  As  a  consequence,  for  player  2,  R  always 
gives  a  strictly  lower  expected  payoff  than  C:  if  player  1  plays  p  =  (p\,pi,pf)  >  0, 
then  the  payoff  from  C  is  Ip2  +  £>3,  and  hence  strictly  larger  than  P2  +  p?>,  which  is 
the  payoff  from  R.  Thus,  any  best  reply  of  player  2  in  a  perturbed  game  puts  minimal 
probability  on  R ,  so  that  the  strategy  (0,  ^,  ^)  can  never  be  the  limit  of  strategies  of 


13.4  Perfect  Equilibrium 


229 


player  2  in  Nash  equilibria  of  perturbed  games.  In  fact,  this  argument  preludes  on 
Theorem  13.21,  which  states  that  weakly  dominated  strategies  (the  third  column  in 
this  case)  are  never  played  with  positive  probability  in  a  perfect  equilibrium.  For 
a  similar  reason,  the  equilibria  in  the  second  collection  are  not  perfect:  D  gives 
player  1  a  strictly  lower  payoff  than  M  if  player  2  plays  each  column  with  positive 
probability,  as  is  the  case  in  a  perturbed  game.  Hence,  player  l’s  strategy  (0, 0, 1) 
cannot  occur  as  the  limit  of  Nash  equilibrium  strategies  in  perturbed  games. 

Now  consider  the  collection  in  (iii).  We  will  show  that  all  Nash  equilibria  in  this 
collection  are  perfect.  For  every  s  >  0  let  G(s )  be  the  perturbed  game  with  /x^  =  s 
for  each  player  i  —  1,2  and  each  row/column  j  —  1, 2,  3.  First,  suppose p  —  1  and 
consider  the  strategy  combination  ((1  —  2s,  s,  s),  (1  —  2 s,  s,  s))  in  G(s).  Given  the 
strategy  of  player  2,  the  three  pure  strategies  (rows)  of  player  1  result  in  the  payoffs 
1,1,  and  2s ,  respectively,  so  that  player  l’s  strategy  (1  —  2s,  s,  s)  is  a  best  reply: 
the  inferior  row  D  is  only  played  with  the  minimally  required  probability.  Similarly, 
given  this  strategy  of  player  1 ,  the  three  pure  strategies  (columns)  of  player  2  yield 
1,  3s,  and  2s,  so  that  player  2’s  strategy  (1  —  2s,  s,  s)  is  a  best  reply:  it  puts  only 
the  minimally  required  probability  on  the  inferior  columns  C  and  R.  Hence,  ((1  — 
2s,  s,  s),  (1  —  2s,  s,  s))  is  a  Nash  equilibrium  in  G(s).  In  fact,  with  these  arguments 
we  are  preluding  on  Lemma  13.19,  in  particular  part  (2).  Since  ((1  —  2s,  s,  s),  (1  — 
2s,  s,  s))  ->  ((1,0, 0),  (1,0, 0))  for  s  ->  0,  we  conclude  that  ((1,0, 0),  (1,0, 0))  is  a 
perfect  equilibrium. 

For  the  other  cases  in  (iii)  the  arguments  are  similar.  For  p  —  0,  take  the  strategy 
combination  ((s,  1  —  2s,  s),  (1  —  2s,  s,  s))  in  G(s).  Given  the  strategy  of  player  2 
the  three  rows  yield  1,1,  and  2s  as  before,  so  that  player  l’s  strategy  is  a  best  reply. 
Given  the  strategy  of  player  1  the  three  columns  now  yield  2  —  3s,  2  —  3s,  and  1  —  s, 
so  that  player  2  still  has  a  best  reply,  putting  only  the  minimally  required  probability 
on  the  inferior  R.  Thus,  ((£,  1  —  2s,  s),  (1  —  2s,  s,  s))  is  a  Nash  equilibrium  in  G(s), 
converging  to  ((0, 1,0),  (1,0,0))  as  s  goes  to  zero.  Hence,  ((0, 1,0),  (1,0,0))  is 
a  perfect  equilibrium.  Finally,  for  0  <  p  <  1,  consider  the  strategy  combination 
((p  —  |,  1  —  p  —  |,  s),  (1  —  2s,  s,  s))  in  G(s).  Player  1  still  plays  a  best  reply,  as 
before.  For  player  2,  the  three  columns  yield  2  —  p  —  y ,  2  —  2 p,  and  1  —  p  +  |.  Since 
player  2’s  strategy  puts  only  the  minimally  required  probability  on  C  and  R,  it  is  a 
best  reply.  Thus,  the  strategy  combination  ((p  —  |,  1  —  p  —  |,  s),  (1  —  2s,  s,  s))  is  a 
Nash  equilibrium  in  G(s),  implying  that  its  limit  (( p ,  1  —  p,  0),  (1 , 0, 0))  is  perfect. 

Also  the  Nash  equilibria  in  (iv)  are  perfect:  the  arguments  are  analogous  to  those 
for  the  collection  in  (iii)  (Problem  13.8).  In  collection  (v),  finally,  if  1  —  p  >  0, 
then  the  Nash  equilibrium  cannot  be  perfect  since  D  always  gives  a  strictly  lower 
payoff  than  M  in  any  perturbed  game,  and  therefore  should  be  played  with  minimal 
probability,  hence  zero  in  the  limit.  The  Nash  equilibrium  ((0,  1 , 0),  (0, 1 , 0))  is  also 
an  element  of  the  collection  in  (iv),  and  thus  perfect.  □ 


Whenever  needed  we  assume  that  s  is  sufficiently  small,  which  is  without  loss  of  generality  since 
we  consider  the  limit  for  s  0.  To  comply  with  our  definitions  we  may  take  values  s  =  sf  =  1  /t, 
t  £  N,  but  this  is  not  essential. 
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The  following  lemma  formalizes  some  of  the  arguments  already  used  in  Exam¬ 
ple  13.18.  It  relates  the  Nash  equilibria  of  a  perturbed  game  to  pure  best  replies  in 
such  a  game.  The  first  part  says  that,  in  a  Nash  equilibrium  of  a  perturbed  game,  if  a 
player  puts  more  than  the  minimally  required  probability  on  some  pure  strategy,  then 
that  pure  strategy  must  be  a  best  reply.  The  second  part  says  that,  in  some  strategy 
combination,  if  all  players  only  put  more  than  the  minimally  required  probabilities 
on  pure  best  replies,  then  that  combination  must  be  a  Nash  equilibrium. 

Lemma  13.19  Let  G(/x)  be  a  perturbed  game  and  let  a  a  strategy  combination  in 
GQjl). 

(1)  If  a  G  NE(G(p)),  i  G  N,  h  G  Sj,  andcrfh)  >  /x^,  then  h  G 

(2)  If  for  all  i  G  N  and  h  G  Si,  crfh)  >  pt^  implies  h  G  /3;(cr_,),  then  o  G  NE(G(p)). 

Proof 

(1)  Let  a  g  Ms(G(/x)),  i  G  N,  h  G  5/,  and  07  (/z)  >  /x^.  Suppose,  contrary  to  what 
we  wish  to  prove,  that  h  £  fifo-i).  Take  hf  G  5/  with  hf  G  fifo-i).  (Such  an  hf 
exists  by  an  argument  similar  to  the  proof  of  Lemma  13.3.)  Consider  the  strategy 
a/  defined  by  0/  (h)  =  pbih,  o[(hr)  =  crf  (A')  +  07  (/z)  -  /x^,  and  cr/(k)  =  07  (k)  for 
all  k  e  St  \  {h,h'}.  Then  a/  g  A  (Si,  p)  and  ufa',  cr_,)  >  z//(cr),  contradicting 
the  assumption  a  G  NE(G(p)). 

(2)  Let  i  G  A.  The  condition  in  (2)  implies  that,  if  h  G  5/  and  /z  ^  fit  (<J-i) ,  then 

(Ji  (h)  —  pih.  This  implies  that  a/  is  a  best  reply  to  Thus,  a  G  NE(G(p)). 


Below  we  present  two  characterizations  of  perfect  Nash  equilibrium  that  both  avoid 
sequences  of  perturbed  games.  The  first  one  is  based  on  the  notion  of  ^-perfect 
equilibrium,  defined  as  follows.  Let  s  >  0.  A  strategy  combination  o  G  YiieN 
is  an  s-perfect  equilibrium  of  G  if  it  is  completely  mixed  and  07  (/z)  <  e  for  all  z  G  N 
and  all  h  G  Si  with  h  ffa-i). 

An  e-perfect  equilibrium  of  G  need  not  be  a  Nash  equilibrium  of  G,  but  it  puts 
probabilities  of  at  most  e  on  pure  strategies  that  are  not  best  replies. 

The  announced  characterizations  are  collected  in  the  following  theorem.  The 
theorem  says  that  a  perfect  equilibrium  is  a  limit  of  e-perfect  equilibria.  Also,  a 
perfect  equilibrium  is  a  limit  of  completely  mixed  strategy  combinations  such  that 
the  strategy  of  each  player  in  the  perfect  equilibrium  under  consideration  is  a  best 
reply  to  the  strategy  combinations  of  the  other  players  in  those  completely  mixed 
strategy  combinations. 

Theorem  13.20  Let  G  =  ( N ,  S\, . . . ,  Sn,  u\, . . . ,  un)  and  o  g  n^iv  The 

following  statements  are  equivalent: 

(1)  o  is  a  perfect  equilibrium  of  G; 

(2)  a  is  a  limit  of  a  sequence  of  s-perfect  equilibria  o£  of  G  for  s  ->  0; 
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(3)  o  is  a  limit  of  a  sequence  of  completely  mixed  strategy  combinations  o£  for 
£  ->  0,  where  Oi  e  ft  ({off  for  each  i  e  A  and  each  o£  in  this  sequence. 


Proof  (1)  =>  (2):  Take  a  sequence  of  perturbed  games  G(pd ),  t  e  N  with  pd  ->  0 
and  a  sequence  or  e  NE(G(pdf)  with  cF  — >  a .  For  each  £  define  £r  e  R  by  e'  = 
max{/xf/7  |  i  e  N,  he  Si}.  Then,  by  Lemma  13.19(1),  o*  is  an  ^-perfect  equilibrium 
for  every  t.  So  (2)  follows. 

(2)  =>  (3):  Take  a  sequence  of  £-perfect  equilibria  o£  as  in  (2)  converging  to  a  for 
£  — >  0.  Let  i  e  N.  By  the  definition  of  £-perfect  equilibrium,  if  ofh)  >  0  for  some 
h  e  Si,  then  for  £  sufficiently  small  we  have  h  e  f>£(o-i).  This  implies  o 7  e  ffoE), 
and  (3)  follows. 

(3)  =>  (1):  Let  o£ ,  t  e  N,  be  a  sequence  as  in  (3)  with  E  ->  0  and  o£  ->  a  as 

t  ^  o o.  For  each  £  e  N,  i  e  N  and  h  e  Si  define  /i\h  —  o£  ( h )  if  07  (/z)  =  0  and 
pt \h  —  £t  otherwise.  Then,  for  t  sufficiently  large,  pd  is  an  error  function,  G(pd)  is 
a  perturbed  game,  and  o£  is  a  strategy  combination  in  G(pd).  By  Lemma  13.19(2), 
( 7s  e  NE(G(pd)).  So  a  is  a  perfect  Nash  equilibrium  of  G.  ■ 


There  is  a  close  relation  between  the  concept  of  domination  and  the  concept  of 
perfection.  We  first  extend  the  concept  of  (weak)  domination  to  mixed  strategies. 
In  the  game  G  —  (N,  S\, . . . ,  Sn,  u\, . . . ,  un),  call  a  strategy  07  e  A  (Si)  (weakly) 
dominated  by  o[  e  A  (Si)  if  ufoi,  a_z)  <  w,(a/,  o-i)  for  all  a_,  e  A (Sj),  with 

at  least  one  inequality  strict.  (Observe  that  it  is  actually  sufficient  to  check  this  for 
combinations  s-i  e  Ylj^i  ^-)  Gi  undominated  if  there  is  no  o[  by  which  it  is 
dominated,  and  call  a  strategy  combination  a  undominated  if  07  is  undominated  for 
every  i  e  N.  We  now  have: 


Theorem  13.21  Every  perfect  Nash  equilibrium  in  G  is  undominated. 

Proof  Let  a  be  a  perfect  Nash  equilibrium  and  suppose  that  (say)  G\  is  dominated. 
Then  there  is  a  o[  e  A  (Si)  such  that  u\  (a\  ,s-l)  <  u\  (o[  ,s- 1)  for  all  e  YYUi 
with  at  least  one  inequality  strict.  Take  a  sequence  (cF)f€N  of  strategy  combinations 
as  in  (3)  of  Theorem  13.20,  converging  to  a.  Then,  since  every  oT  is  completely 
mixed,  we  have  u\(o\ ,  off)  <  u\(o[,  oT_f)  for  every  t.  This  contradicts  the  fact  that 
G\  is  a  best  reply  to  cri , .  ■ 

The  converse  of  Theorem  13.21  is  only  true  for  two-person  games.  For  a  counterex¬ 
ample  involving  three  players,  see  Problem  13.15. 

For  proving  the  converse  of  the  theorem  for  bimatrix  games,  we  use  the  following 
auxiliary  lemma.  In  this  lemma,  for  a  matrix  game  A,  C2(A)  denotes  the  set  of  all 
columns  of  A  that  are  in  the  carrier  of  some  optimal  strategy  of  player  2  in  A. 

Lemma  13.22  Eet  G  —  (A,  B)  be  an  m  x  n  bimatrix  game  and  let  p  e  Am.  Define 
the  mxn  matrix  A  —  (dif)  by  ciij  —  aij  —  p Ae!  for  all  i  —  1 , . . . ,  m  and  j  —  1 , . . . ,  n. 
Then  p  is  undominated  in  G  if  and  only  if  v  (A)  =  0  and  C2  (A)  =  {l, ...  ,n}. 
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Proof  First  note  that  pA  =  0  and  therefore  v(A)  >  0. 

For  the  if-direction,  suppose  that  p  is  dominated  in  G,  say  by  p;.  Then  p'A  ^  pA, 
hence  p'A  ^  0.  Therefore,  if  v(A )  =  0,  then  p'  is  an  optimal  strategy  in  A;  take  a 
column  j  with  p'Ae7  >  0,  then  we  have  j  C2(A)  and,  thus,  C2(A)  ^  {1 , ...  ,n}. 
This  proves  the  if-direction. 

For  the  only-if  direction  suppose  that  p  is  undominated  in  G.  Suppose  we  had 
v(A)  >  0.  Then  take  an  optimal  strategy  p  of  player  1  in  A,  so  that  pA  >  0,  hence 
pA  >  pA,  a  contradiction.  Thus,  v(A)  =  0.  Suppose  there  is  a  column  j  that  is  not 
an  element  of  C2(A).  By  Problem  12.7  there  must  be  an  optimal  strategy  pr  of  player 
1  in  A  such  that  p'Ae7  >  0,  so  that  p'A  ^  0,  hence  p'A  >  pA.  So  p'  dominates  p  in 
G,  a  contradiction.  This  proves  the  only-if  direction.  ■ 

Theorem  13.23  Let  G  =  ( A,B )  be  a  bimatrix  game,  and  let  (p,  q)  be  an 
undominated  Nash  equilibrium.  Then  (p,  q)  is  perfect. 

Proof  Let  A  as  in  Lemma  13.22,  then  p  is  an  optimal  strategy  for  player  1  in  A  since 
PA  =  0  and  v(A)  =  0.  By  Lemma  13.22  we  can  find  a  completely  mixed  optimal 
strategy  q'  for  player  2  in  A.  So  p  is  a  best  reply  to  q'  in  A,  i.e.,  pAq'  >  pAq'  for 
all  p,  and  thus  pAq'  —  pAq'  <  0  for  all  p.  So  p  is  also  a  best  reply  to  q'  in  G. 
For  1  >  s  >  0  define  qe  =  (1  —  e)q  +  sc( .  Then  qe  is  completely  mixed,  p  is 
a  best  reply  to  qs,  and  qe  ->  q  for  s  ->  0.  In  the  same  way  we  can  construct  a 
sequence  p£  with  analogous  properties,  converging  to  p.  Then  implication  (3)  =>► 
(1)  in  Theorem  13.20  implies  that  (p,  q)  is  perfect.  ■ 

Example  13.24  Consider  again  the  game  G  from  Example  13.18.  Row  D  is  weakly 
dominated  by  M,  and  R  by  C.  There  are  no  other  weakly  dominated  strategies  in 
this  game.  Thus,  as  established  earlier,  but  now  using  Theorems  13.21  and  13.23, 
the  set  of  perfect  equilibria  is  the  set  {(( p ,  1  —  p,  0),  (1, 0, 0))  |  0  <  p  <  1}  U 
{((0,  1,0),  (q,  l  —  q,0))  \  0  <  q  <  1}.  □ 

The  following  example  shows  an  advantage  but  at  the  same  time  a  drawback  of 
perfect  equilibrium:  a  perfect  equilibrium  may  be  payoff-dominated  by  another 
Nash  equilibrium. 

Example  13.25  Consider  the  bimatrix  game 

L  R 

U  (  1,1  10,0  \ 

D  \0,  10  10,  10 )  ’ 

which  has  two  Nash  equilibria,  both  pure,  namely  ( U,L )  and  ( D,R ).  Only  ( U,L ) 
is  perfect,  as  can  be  seen  by  direct  inspection  or  by  applying  Theorems  13.21 
and  13.23.  At  the  equilibrium  ( D,R ),  each  player  has  an  incentive  to  deviate  to 
the  other  pure  strategy  since  the  opponent  may  deviate  by  mistake.  This  equilibrium 
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is  excluded  by  perfection.  On  the  other  hand,  the  unique  perfect  equilibrium  ( U ,  L) 
is  payoff-dominated  by  the  equilibrium  ( D,R ).  □ 

Another  drawback  of  perfect  equilibrium  is  the  fact  that  adding  strictly  dominated 
strategies  may  result  in  adding  perfect  Nash  equilibria,  as  the  following  example 
shows. 


Example  13.26  In  the  game 


there  are  two  perfect  Nash  equilibria,  namely  ( U,L )  and  (M,  C).  If  we  reduce  the 
game  by  deleting  the  strictly  dominated  pure  strategies  D  and  R ,  the  only  perfect 
equilibrium  that  remains  is  ( U ,  L).  □ 

This  motivated  the  introduction  of  a  further  refinement  called  proper  Nash  equilib¬ 
rium.  See  the  next  section. 

1 3.5  Proper  Equilibrium 

A  perfect  equilibrium  is  required  to  be  robust  only  against  some  ‘trembles’  and, 
moreover,  there  are  no  further  conditions  on  these  trembles.  We  now  propose  the 
additional  restriction  that  trembles  be  less  probable  if  they  are  more  ‘costly’. 

Given  some  e  >  0,  call  a  strategy  combination  a  in  the  game  G  — 

(A,  Si, . . . ,  Sn,  Mi, . . . ,  un)  an  e-proper  equilibrium  if  a  is  completely  mixed  and 
for  all  i  e  N  and  h,k  e  St  we  have 


Ui(h,  G-i)  <  Ui(k,  (J—i)  =>  Gi(h)  <  eOi(k)  . 


Observe  that  an  e-proper  equilibrium  does  not  have  to  be  a  Nash  equilibrium. 

Definition  13.27  A  strategy  combination  o  in  G  is  proper  if,  for  some  sequence 
sT  — >  0,  t  e  N,  there  exist  aAproper  equilibria  a(V)  such  that  a(V)  — >  a.  □ 

Since,  in  a  proper  strategy  combination  a,  a  pure  strategy  h  of  a  player  i  that  is  not 
a  best  reply  to  a_/  is  played  with  probability  0,  it  follows  that  a  proper  strategy 
combination  is  a  Nash  equilibrium.  Moreover,  since  it  is  straightforward  by  the 
definitions  that  an  e-proper  equilibrium  is  also  an  e-perfect  equilibrium,  it  follows 
from  Theorem  13.20  that  a  proper  equilibrium  is  perfect.  Hence,  properness  is  a 
refinement  of  perfection. 
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Remark  13.28  By  replacing  the  word  ‘proper’  by  ‘perfect’  in  Definition  13.27, 
we  obtain  an  alternative  definition  of  perfect  equilibrium.  This  follows  from 
Theorem  13.20.  □ 

Example  13.26  shows  that  properness  is  a  strict  refinement  of  perfection:  the  Nash 
equilibrium  (M,  C)  is  perfect  but  not  proper.  To  see  this,  for  sT  >  0  with  sT  ->  0  as 
t  — >  oo,  let  (p(V),  q(V))  be  a  converging  sequence  of  £Cpr0per  equilibria.  Then  for 
each  t  we  must  have  <73 (V)  <  stq\(£t).  In  turn,  this  implies  that  U  is  the  unique  best 
reply  to  q(V),  hence p2(e 0  <  FpxiF)  for  each  t.  But  then  p(V)  cannot  converge  to 
(0,1,0). 

A  proper  Nash  equilibrium  always  exists: 

Theorem  13.29  Let  G  =  ( N ,  S\ , . . . ,  Sn,  u\, . . . ,  un)  be  a  finite  game.  Then  G  has  a 
proper  Nash  equilibrium. 

Proof  It  is  sufficient  to  show  that  for  e  >  0  close  to  0  there  exists  an  ^-proper 
equilibrium  of  G.  Let  0  <  e  <  1  and  define  the  error  function  \i  by  /x^  =  £^/|S)| 
for  all  i  G  N  and  k  G  Si .  For  every  i  e  N  and  g  e  A  (5),  /x)  define 

Ffcr)  =  {Xi  G  A  (Si,  /x)  I  Vfc,  /  g  Si  [; Ui(k ,  G-i)  <  Ui(l,  G-i )  =>  Xi(k)  <  srfl)]}  . 

Then  E/(a)  ^  0,  as  can  be  seen  as  follows.  Define 


Vi(cr,k)  =  | {/  e 


ut(k,G-i )  <  Mf(Z,  cr_/)}|  , 


and  define  r,(/c)  =  ev3°T) /  J2leS.  £Vl(o'l\  for  all  k  G  5*.  Then  t/  G  E/(a).  Consider 
the  correspondence 


F  :  jq  A(5;,  /1)  ^  n  A(‘S)’  ^  n  • 


i'G^V 


Then  F  satisfies  the  conditions  of  the  Kakutani  Fixed  Point  Theorem  (Theo¬ 
rem  22. 11) — see  Problem  13.16.  Hence,  F  has  a  fixed  point,  and  each  fixed  point  of 
F  is  an  ^-proper  equilibrium  of  G.  ■ 


In  spite  of  the  original  motivation  for  introducing  properness,  this  concept  suffers 
from  the  same  deficit  as  perfect  equilibrium:  adding  strictly  dominated  strategies 
may  enlarge  the  set  of  proper  Nash  equilibria.  See  Problem  13.17  for  an  example  of 
this. 
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Example  13.30  We  consider  again  the  game  G  from  Example  13.18: 


L 

c 

R 

U  i 

( !’ 1 

0,0 

2, 0\ 

G  —  M  \ 

1,2 

1,2 

1,1  . 

D  ' 

l  0,0 

1,1 

1,1/ 

The  perfect  equilibria  are: 

(a)  {((p,  1  —  p,  0),  (1, 0, 0))  |  0  <  p  <  1}, 

(b)  {((0,1,0),  (4, 1—4,0))  |0<<?<  1}. 

First  consider  the  equilibria  in  (a).  Which  ones  are  proper?  In  any  e-proper 
equilibrium  (pe,  qe),  player  1  plays  a  completely  mixed  strategy.  This  implies  that 
C  gives  player  2  a  higher  payoff  than  R ,  so  that  q\  <  sq2.  For  player  1,  the  payoff 
of  U  is  q\  +  2q$  and  the  payoff  of  M  is  q\  +  qs2  +  q\  >  q\  +  2q\  for  s  small,  so 
that  we  obtain  p\  <  sp2 .  Hence,  for  p  >  0  the  equilibria  in  (a)  are  not  proper.  The 
equilibrium  ((0,  1, 0),  (1, 0,  0))  is  proper:  it  is  the  limit  of  the  ^-proper  equilibria 
((£,  1,  s2)/q,  (1,  s,  s2)/q ),  where  q  =  1  +  e  +  s2. 

Next  consider  the  equilibria  in  (b).  As  before,  in  any  ^-proper  equilibrium  (pe ,  qe) 
we  must  have  q\  <  sq2 .  This  implies  that  for  player  1,  the  payoff  from  U  is  higher 
than  the  payoff  from  D ,  so  that  we  must  have  p\  <  ep\ .  In  turn,  this  implies  that 
for  player  2  the  payoff  from  L  is  higher  than  the  payoff  from  C,  so  that  q2  <  sq\ . 
Thus,  the  only  candidate  in  (b)  for  a  proper  equilibrium  is  ((0,  1, 0),  (1, 0,  0)),  and 
we  have  already  established  that  this  combination  is  proper  indeed.  Hence,  it  is  the 
unique  proper  equilibrium  of  G.  □ 


1 3.6  Strictly  Perfect  Equilibrium 

Another  refinement  of  perfect  equilibrium  is  obtained  by  requiring  robustness  of  a 
Nash  equilibrium  with  respect  to  all  ‘trembles’ .  This  results  in  the  concept  of  strictly 
perfect  equilibrium. 

Definition  13.31  A  strategy  combination  a  in  the  game  G  —  (N,  S\, . . . ,  Sn, 
Mi, . . . ,  un)  is  a  strictly  perfect  equilibrium  if,  for  every  sequence  {G(/F)},  t  e  N,  of 
perturbed  games  with  p!  ->  0  for  t  ->  oo,  there  exist  profiles  cF  e  NE(G(/iT))  such 
that  aT  — >  a. 

Clearly,  a  strictly  perfect  equilibrium  is  a  perfect  equilibrium.  For  some  further 
observations  concerning  strictly  perfect  equilibrium  see  Problem  13.18. 

A  clear  drawback  of  strictly  perfect  equilibrium  is  the  fact  that  it  does  not  have 
to  exist,  as  the  following  (continued)  example  shows. 
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Example  13.32  We  consider  again  the  game  G  from  Example  13.18: 


L 

c 

R 

U  i 

(11 

0,0 

2, 0\ 

G  —  M  \ 

1,2 

1,2 

1,1  , 

D  ' 

1 0,0 

1.1 

1.1/ 

with  perfect  equilibria: 

(a)  {((p,l-p,0),  (1,0,0))  |  0<p<  1}, 

(b)  {((0, 1,0),  (q,  1-4,0))  |  0<q<  1}. 

Consider  the  equilibria  in  (a)  and  suppose  p  >  0.  Let  G(p )  be  the  perturbed  game  as 
in  Sect.  13.4.  We  may  assume  that,  since p  >  o,  L  must  yield  player  2  a  higher  payoff 
than  C  in  a  Nash  equilibrium  of  G(fi).  In  turn  this  implies  that  player  2’s  equilibrium 
strategy  in  any  Nash  equilibrium  of  G(fi )  is  equal  to  (1  —  P22  —  P23, 1^22,  M 23)  (we 
already  saw  that  player  2  must  always  put  minimal  probability  on  R).  Now  choose 
li 23  <  fi22,  then  the  payoff  1  —  fi 22  +  M23  to  player  1  from  playing  U  is  smaller  than 
the  payoff  1  from  M.  This  implies  that  player  1  plays  M  in  the  limit.  Thus,  a  perfect 
equilibrium  of  the  form  (( p ,  1  —  p,  0),  (1, 0, 0))  with  p  >  0  is  not  strictly  perfect. 

Next,  consider  the  equilibrium  ((0,  1 , 0),  (1 , 0,  0)),  which  is  both  in  (a)  and  in  (b). 
Assume  again  that  P23  <  ft  22-  If  player  2  plays  (q\,q2,  q 3)  in  G(fi),  then  q 3  =  /X23 
and  the  payoff  to  player  1  from  U  is  q\  +  2fi23,  whereas  the  payoff  from  M  is 
q\  +  q2  +  /x 23.  Since  <72  >  M22  >  ft 23,  this  implies  that  M  yields  player  1  a  higher 
payoff  than  U.  In  turn,  this  implies  that  player  l’s  strategy  in  any  Nash  equilibrium 
of  G(fi)  is  equal  to  (fin,  1  —  fin  —  fin,  fin)  (recall  that  player  1  puts  minimal 
probability  on  D).  Now  suppose  that  fi\\  <  fin ,  then  the  payoff  to  player  2  from 
C  is  higher  than  the  payoff  from  L,  so  that  player  2  must  play  C  in  the  limit.  Thus, 
also  the  perfect  equilibrium  ((0, 1, 0),  (1, 0,  0))  with p  >  0  is  not  strictly  perfect. 

For  the  equilibria  in  (b)  with  q  <  1,  the  argument  is  almost  identical  to  the  one 
in  the  preceding  paragraph.  In  the  end,  choose  fin  >  fin,  then  the  payoff  to  player 
2  from  L  is  higher  than  the  payoff  from  C,  so  that  player  2  must  play  L  in  the  limit. 
Hence,  no  equilibrium  of  the  form  {((0, 1,0),  (<7,  1  —  q,  0))  with  q  <  1  is  strictly 
perfect.  We  conclude  that  this  game  has  no  strictly  perfect  equilibria.  □ 


1 3.7  Correlated  Equilibrium 

In  the  preceding  sections  we  studied  several  refinements  of  Nash  equilibrium.  In 
this  section  the  set  of  Nash  equilibria  is  extended  in  a  way  to  become  clear  below. 
It  is,  however,  not  the  intention  to  enlarge  the  set  of  Nash  equilibria  but  rather  to 
enable  the  players  to  reach  better  payoffs  by  allowing  some  communication  device. 
This  will  result  in  the  concept  of  correlated  equilibrium.  Attention  in  this  section  is 
restricted  to  bimatrix  games. 
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In  order  to  fix  ideas,  consider  the  situation  where  two  car  drivers  approach  a 
road  crossing.  Each  driver  has  two  pure  strategies:  ‘stop’  (s)  or  ‘cross’  (c).  The 
preferences  for  the  resulting  combinations  are  as  expressed  by  the  following  table: 


(A.B) 


c  s 

c  (- 10,-10  5,0  \ 

*  V  °’5  — — 1 )  ' 


This  bimatrix  game  has  two  asymmetric  and  seemingly  unfair  pure  Nash  equilibria, 
and  one  symmetric  mixed  Nash  equilibrium  ((3/8,  5/8) ,  (3/8,  5/ 8)),  resulting  in  an 
expected  payoff  of  —5/8  for  both,  and  therefore  also  not  quite  satisfying. 

Now  suppose  that  traffic  lights  are  installed  that  indicate  c  (‘green’)  or  s  (‘red’) 
according  to  the  probabilities  in  the  following  table: 


c  s 

c  (  0.00  0.55  \ 
j  V0.40  0.05  J  * 

For  example,  with  probability  0.55  (55  %  of  the  time)  the  light  is  green  for  driver 
1  and  red  for  driver  2.  Assume  that  the  players  (drivers)  are  not  forced  to  obey  the 
traffic  lights  but  know  the  probabilities  as  given  in  the  table.  We  argue  that  it  is  in 
each  player’s  own  interest  to  obey  the  lights  if  the  other  player  does  so. 

If  the  light  is  green  for  player  1  then  player  1  knows  with  certainty  that  the  light 
is  red  for  player  2.  So  if  player  2  obeys  the  lights  and  stops,  it  is  indeed  optimal 
for  player  1  to  cross.  If  the  light  is  red  for  player  1,  then  the  conditional  probability 
that  player  2  crosses  (if  he  obeys  the  lights)  is  equal  to  0.4/0.45  %  0.89  and  the 
conditional  probability  that  player  2  stops  is  0.05/0.45  %  0. 1 1 .  So  if  player  1  stops, 
his  expected  payoff  is  0.89  -  0  +  0.11-— 1  =  —0.1 1,  and  if  he  crosses  his  expected 
payoff  is  0. 89-— 10  +  0. 11 -  5  =  —8.35.  Clearly,  it  is  optimal  for  player  1  to  obey 
the  light  and  stop. 

For  player  2  the  argument  is  similar.  If  the  light  is  green  for  player  2  then  he 
knows  with  certainty  that  player  1  has  red  light.  Thus,  if  player  1  obeys  the  red  light 
and  stops,  it  is  optimal  for  player  2  to  cross.  If  player  2  has  red  light  then  he  knows 
that  the  conditional  probabilities  are  0.55/0.60  ^  0.92  and  0.05/0.60  ^  0.08  that 
player  1  has  green  light  and  red  light,  respectively.  For  player  2,  assuming  that  player 
1  obeys  the  traffic  lights,  stopping  yields  an  expected  payoff  of  0.08  —  1  and  crossing 
an  expected  payoff  of  0.92  •  — 10  +  0.08  •  5:  clearly,  stopping  is  optimal. 

Thus,  we  can  indeed  talk  of  an  equilibrium:  such  an  equilibrium  is  called  a 
correlated  equilibrium.  Note  that  there  is  no  mixed  strategy  combination  in  the 
game  (A, B)  that  induces  these  probabilities.  If  p  =  (p\,p2)  and  q  =  (<71,^2) 
are  mixed  strategies  of  players  1  and  2,  respectively,  then  we  would  need  p\  ^  0 
since  p\q2  —  0.55  and  we  would  need  q\  ^  0  since  p2q\  —  0.40,  but  then 
P\q\  ^  0,  hence  the  combination  (c,  c )  would  have  positive  probability.  In  terms  of 
the  situation  in  the  example,  this  particular  equilibrium  cannot  be  reached  without 
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traffic  lights  serving  as  a  communication  device  between  the  players.  The  overall 
expected  payoffs  of  the  players  are  0.55  •  5  +  0.05  •  — 1  =  2.7  for  player  1  and 
0.40  •  5  +  0.05  •  —  1  =  1.95  for  player  2,  which  is  considerably  better  for  both  than 
the  payoffs  in  the  mixed  Nash  equilibrium. 

In  general,  let  (A,  B)  be  an  m  x  n  bimatrix  game.  A  correlated  strategy  is  an 
m  x  n  matrix  P  —  ( py )  with  Y17=  l  Y7j=\Pij  —  1  and  Pij  >  0  for  ah  i  =  1, . . . ,  m, 
j  —  l, ...  ,n.  A  correlated  strategy  P  can  be  thought  of  as  a  communication  device: 
the  pair  (ij)  is  chosen  with  probability  ptj,  and  if  that  happens,  player  1  receives 
the  signal  i  and  player  2  the  signal  j.  Suppose  player  2  obeys  the  signal.  If  player  1 
receives  signal  i  and  indeed  plays  /,  his  expected  payoff  is 


n  I  n 

r.pw  j  > 

7=1  /  7=1 

and  if  he  plays  row  k  instead,  his  expected  payoff  is 


n  In 

t>^7  j  Pij  • 

7=1  /  7=1 

So  to  keep  player  1  from  disobeying  the  received  signal,  we  should  have 

n 

ciij  —  cikj)pij  >  0  for  all  /,  k  =  1, . . . ,  m  .  (13.5) 

7=1 

The  analogous  condition  for  player  2  is: 

m 

Tib,  -  bu)pij  >  0  for  all  /.  /  =  I .....  n  .  (13.6) 

1=1 

Definition  13.33  A  correlated  equilibrium  in  the  bimatrix  game  (A,  B )  is  a  corre¬ 
lated  strategy  P  =  (p^ )  satisfying  (13.5)  and  (13.6).  □ 

For  the  two-driver  example  conditions  (13.5)  and  (13.6)  result  in  four  inequalities, 
which  are  not  difficult  to  solve  (Problem  13.19).  In  general,  any  Nash  equilibrium 
of  a  bimatrix  game  results  in  a  correlated  equilibrium  (Problem  13.20),  so  existence 
of  a  correlated  equilibrium  is  not  an  issue. 

The  set  of  correlated  equilibria  is  convex  (Problem  13.21),  so  the  convex  huh  of 
all  payoff  pairs  corresponding  to  the  Nash  equilibria  of  a  bimatrix  game  consists  of 
payoff  pairs  attainable  in  correlated  equilibria.  Problem  13.22  presents  an  example 
of  a  game  in  which  some  payoff  pairs  can  be  reached  in  correlated  equilibria  but  not 
as  convex  combinations  of  payoff  pairs  of  Nash  equilibria. 
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In  general,  correlated  equilibria  can  be  computed  using  linear  programming. 
Specifically,  let  (A,B)  be  an  m  x  n  bimatrix  game.  We  associate  with  (A,  B)  an 
mn  x  (m(m—  1)  +  n(n—  1))  matrix  C  as  follows.  For  each  pair  (ij)  of  a  row  and  a 
column  in  (A,  B)  we  have  a  row  in  C,  and  for  each  pair  (h,  k)  of  two  different  rows 
in  (A,  B)  or  two  different  columns  in  (A,  B)  we  have  a  column  in  C.  We  define 

!ciij  —  akj  if  i  =  h  G  { 1 , . . . , m)  and  k  E  { 1 , ,m} 
bij-bik  if  j  =  he  {1, . . . ,  n)  and  k  e  {1 
0  otherwise. 

Let  P  —  (pij )  be  a  correlated  strategy  in  ( A,B ).  Then  P  can  be  seen  as  a  vector 
p  e  Rmn.  Let  c^,k)  be  a  column  in  C.  If  h  and  k  are  rows  of  (A,  5)  we  have 

n 

P  *  C(h,k)  —  ^  'jPhjiP'hj  &kj) 

7=1 

and  if  h  and  k  are  columns  of  (A,  Z?)  we  have 

m 

P  *  C(h,k)  —  y  ^ PihiPih  ^ik)  • 

/=  1 

Hence,  by  (13.5)  and  (13.6),  P  is  a  correlated  equilibrium  of  (A,B)  if  and  only  if 
pC  >  0.  If  we  consider  C  as  a  matrix  game  and  p  as  a  strategy  of  player  1  in 
C — not  to  be  confused  with  player  1  in  (A,  B) — then  the  existence  of  a  correlated 
equilibrium  implies  that  n(C),  the  value  of  the  matrix  game  C,  is  nonnegative.  In 
particular,  this  implies  that  any  optimal  strategy  of  player  1  in  C  is  a  correlated 
equilibrium  in  ( A,B ).  If  v(C)  =  0  then  any  correlated  equilibrium  in  (A,  B)  is  an 
optimal  strategy  of  player  1  in  C,  but  if  v(C)  >  0  then  there  may  be  correlated 
equilibria  in  (A,  B)  that  are  not  optimal  strategies  for  player  1  in  C — they  may  only 
guarantee  zero.  The  latter  is  the  case  in  the  two-drivers  example  (Problem  13.23). 

Matrix  games  can  be  solved  by  linear  programming,  see  Sect.  12.2.  We  conclude 
with  an  example  in  which  the  described  technique  is  applied. 

Example  13.34  Consider  the  bimatrix  game 


(A.B) 


V  2'  3' 

1  /  3, 1  2,5  6,0\ 

2  (  1, 4  3,3  2,6  J  • 
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The  associated  matrix  game  C  is  as  follows. 


(1,2) 

(2,1) 

Cl',  20 

(l',30 

(2/  10 

(2/  30 

(30 10 

(3020 

(MO  ( 

2 

0 

-4 

1 

0 

0 

0 

0 

(1,20 

-1 

0 

0 

0 

4 

5 

0 

0 

(1,30 

4 

0 

0 

0 

0 

0 

-1 

-5 

(2, 10 

0 

-2 

1 

-2 

0 

0 

0 

0 

(2,  2') 

0 

1 

0 

0 

-1 

-3 

0 

0 

(2,  30  V 

0 

-4 

0 

0 

0 

0 

2 

3  ) 

It  can  be  checked  that  this  game  has  value  0,  by  using  the  optimal  strategies 


t  i'  y 

1  /  0  0.3  0.075  \ 

”  —  2  y  0  0.5  0.125 ) 

for  player  1  in  C  and  (0,0,  1/2,  1/2,  0,0,  0,0)  for  player  2  in  C.  The  optimal 
strategy  for  player  1  in  C  is  unique,  and  since  v(C )  =  0  this  implies  that  the  game 
( A,B )  has  a  unique  correlated  equilibrium.  Consequently,  this  must  correspond  to 
the  unique  Nash  equilibrium  of  the  game.  Indeed,  ((3/8,  5/8),  (0,  4/5,  1/5))  is  the 
unique  Nash  equilibrium  of  (A,  B )  and  it  results  in  the  probabilities  given  by  P.  □ 


1 3.8  A  Characterization  of  Nash  Equilibrium 

The  concept  of  Nash  equilibrium  requires  strong  behavioral  assumptions  about  the 
players.  Each  player  should  be  able  to  guess  what  other  players  will  do,  assume  that 
other  players  know  this  and  make  similar  conjectures,  and  so  on,  and  all  this  should 
be  in  equilibrium.  The  basic  difficulty  is  that  Nash  equilibrium  is  a  circular  concept: 
a  player  plays  a  best  reply  against  the  conjectured  strategies  of  the  opponents  but, 
in  turn,  this  best  reply  should  be  conjectured  by  the  opponents  and  they  should  play 
best  replies  as  well.  Not  surprisingly,  theories  of  repeated  play  or  learning  or,  more 
generally,  dynamic  models  that  aim  to  explain  how  players  in  a  game  come  to  play 
a  Nash  equilibrium,  have  in  common  that  they  change  the  strategic  decision  into  a 
collection  of  single-player  decision  problems. 

In  this  section  we  review  a  different  approach,  which  is  axiomatic  in  nature.  The 
Nash  equilibrium  concept  is  viewed  as  a  solution  concept:  a  correspondence  which 
assigns  to  any  finite  game  a  set  of  strategy  combinations.  One  of  the  conditions 
(axioms)  put  on  this  correspondence  is  a  condition  of  consistency  with  respect  to 
changes  in  the  number  of  players:  if  a  player  leaves  the  game,  leaving  his  strategy 
as  an  input  behind,  then  the  other  players  should  not  want  to  change  their  strategies. 
This  is  certainly  true  for  Nash  equilibrium,  and  it  can  be  imposed  as  a  condition  on  a 
solution  correspondence.  By  assuming  that  players  in  single-player  games — hence, 
in  ‘simple’  maximization  problems — behave  rationally,  and  by  adding  a  converse 
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consistency  condition,  it  follows  that  the  solution  correspondence  must  be  the  Nash 
equilibrium  correspondence.  We  proceed  with  a  formal  treatment  of  this  axiomatic 
characterization. 

Let  r  be  a  collection  of  finite  games  of  the  form  G  —  (N,  S\ , . . . ,  Sn,  u\ , . . . ,  un). 
(It  is  implicit  that  also  the  set  of  players  N  may  vary  in  r .)  A  solution  on  f  is  a 
function  <p  that  assigns  to  each  G  e  r  a  set  of  strategy  combinations  <p(G)  c 
Y\ieN  A  particular  solution  is  the  Nash  correspondence  NE,  assigning  to  each 

G  e  T  the  set  NE(G )  of  all  Nash  equilibria  of  G. 

Definition  13.35  The  solution  cp  satisfies  one-person  rationality  (OPR)  if 

<p(G)  =  {(Ti  e  A  (Si)  |  Ui(Gi)  >  Ui(xi)  for  all  n  e  A  (5/)} 

for  every  one-person  game  G  —  ({/},  Si,  ut)  in  r.  □ 

The  interpretation  of  OPR  is  clear  and  needs  no  further  comment. 

Let  G  —  ( N , S\, ...  ,Sn,  u\, ...  ,un)  be  a  game,  0  ^  M  c  N,  and  let  a  be 
a  strategy  combination  in  G.  The  reduced  game  of  G  with  respect  to  M  and  a 
is  the  game  GM'°  =  (M,  (5/)/€m,  where  wf  (r)  =  x,gn\m)  for  all 

r  e  njGM  A (5)). 2  The  interpretation  of  such  a  reduced  game  is  straightforward: 
if  the  players  of  N  \  M  leave  the  game,  leaving  their  strategy  combination  gn\m 
behind,  then  the  remaining  players  are  faced  with  the  game  GM,° .  Alternatively,  if 
it  is  common  knowledge  among  the  players  in  M  that  the  players  outside  M  play 
according  to  a,  then  they  are  faced  with  the  game  GM,° .  Call  a  collection  of  games 
r  closed  if  it  is  closed  under  taking  reduced  games. 

Definition  13.36  Let  r  be  a  closed  collection  of  games  and  let  cp  be  a  solution  on 
r .  Then  <p  is  consistent  (CONS)  if  for  every  game  G  =  (N,  Si, . . . ,  Sn,  u\, . . . ,  un), 
every  0  ^  M  c  N,  and  every  strategy  combination  o  e  <p(G),  we  have  gm  € 
q o(Gm'° ).  □ 

The  interpretation  of  consistency  is  as  follows.  If  the  players  outside  M  have  left  the 
game  while  leaving  the  strategy  combination  gn\m  behind,  then  there  should  be  no 
need  for  the  remaining  players  to  revise  their  strategies. 

The  consequence  of  imposing  OPR  and  CONS  on  a  solution  correspondence  is 
that  it  can  contain  only  Nash  equilibria: 

Proposition  13.37  Let  r  be  a  closed  collection  of  games  and  let  cp  be  a  solution 
on  r  satisfying  OPR  and  CONS.  Then  cp(G )  c  NE{G)  for  every  G  e  T. 

Proof  Let  G  =  ( N ,  S\, . . . ,  Sn,  u\, . . . ,  un)  e  T  and  a  e  <p(G).  By  CONS,  a,  e 
<p(G^,a)  for  every  i  e  N.  By  OPR,  u°  (a/)  >  u®  (r /)  for  every  r /  e  A(^/)  and  i  e  N. 


2 For  a  subset  T  C  N,  we  denote  (ct7)jG T  by  gt- 
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Hence 


Uiipi ,  crN\{i})  >  Otv\{/})  for  every  rf  e  A (5f)  and  i  e  N. 


Thus,  a  e  NE(G).  ■ 

Proposition  13.37  says  that  AE  is  the  maximal  solution  (with  respect  to  set- 
inclusion)  satisfying  OPR  and  CONS.  (It  it  trivial  to  see  that  NE  satisfies  these 
conditions.)  To  derive  a  similar  minimal  set- inclusion  result  we  use  another 
condition. 

Let  r  be  a  closed  collection  of  games  and  let  <^bea  solution  on  r .  For  a  game 
G  =  (N,  S\ , . . . ,  Sn,  u\ , . . . ,  un)  e  r  with  \N\  >  2  denote 

<p(G)  =  {o'  e  11  A  (.S',)  |  for  all  0  ^  M  £  N,  aM  e  <p(GM’a)} . 

ieN 

Definition  13.38  A  solution  tp  on  a  closed  collection  of  games  satisfies  converse 
consistency  (COCONS)  if  for  every  game  G  with  at  least  two  players,  <p(G)  c 
<p(G).  □ 

Converse  consistency  says  that  strategy  combinations  of  which  the  restrictions 
belong  to  the  solution  in  smaller  reduced  games,  should  also  belong  to  the  solution 
of  the  game  itself.  Note  that  consistency  can  be  defined  by  the  converse  inclusion 
<p(G)  Q  ^(G)  for  every  G  e  r,  which  explains  the  expression  ‘converse 
consistency’.  Obviously,  the  Nash  equilibrium  correspondence  satisfies  COCONS. 

Proposition  13.39  Let  r  be  a  closed  collection  of  games  and  let  ip  be  a  solution 
on  r  satisfying  OPR  and  COCONS.  Then  cp(G)  ^  NE(G)  for  every  G  e  T. 

Proof  The  proof  is  by  induction  on  the  number  of  players.  For  one-person  games 
the  inclusion  follows  (with  equality)  from  OPR.  Assume  that  NE(G)  c  <p(G)  for 
all  t-person  games  in  C,  where  t  <  k  and  k  >  1.  Let  Go  be  a  k  +  1 -person  game 
in^C.  Note  that  NE(Go)  c  NE(Go)  by  CONS  of  NE.  By  the  induction  hypothesis, 
NE(G0)  c  cp (Gf)  and  by  COCONS,  f(G0)  c  (p(G0).  Thus,  NE(G0)  c  (p(G0).  ■ 

Corollary  13.40  Let  r  be  a  closed  collection  of  games.  The  Nash  equilibrium 
correspondence  is  the  unique  solution  on  T  satisfying  OPR,  CONS,  and  COCONS. 

It  can  be  shown  that  the  axioms  in  Corollary  13.40  are  independent  (Problem  13.26). 

In  general,  the  consistency  approach  fails  when  applied  to  refinements  of  Nash 
equilibrium.  For  instance,  Problem  13.27  shows  that  the  correspondence  of  perfect 
equilibria  is  not  consistent. 
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13.9  Problems 

13.1.  Existence  of  Nash  Equilibrium  Using  Brouwer 

Let  G  —  (N,  S\, . . . ,  Sn,  mi,  . . . ,  un)  be  a  finite  game,  as  defined  in  Sect.  13.1.  Define 

the  function/  :  Yl  A  (Si)  — >  Yl  A  (Si)  by 

ieN  ieN 

0-/Q/)  +  max{0,  Uj(sj,  O-j)  -  Uj(cr)} 

1  +  E  max{0,  ufs'^  o-t)  -  ufa)} 

Si 

for  all  i  £  N  and  si  £  Si,  where  o-i  —  (g\,  ...  ,  a/_i ,  07+1 , . . . ,  an). 

(a)  Show  that  /  is  well-defined,  i.e.,  that /(a)  £  Yl  A  (Si)  for  every  a  £  n  a  (sd. 

ieN  ieN 

(b)  Argue  that  /  has  a  fixed  point,  i.e.,  there  is  a*  £  Yl  A  (Si)  with /(a*)  =  a*,  by 

ieN 

using  the  Brouwer  Fixed  Point  Theorem  (Theorem  22.10). 

(c)  Show  that  a*  £  Yl  A  (Si)  is  a  fixed  point  of  /  if  and  only  if  it  is  a  Nash 

ieN 

equilibrium  of  G. 

13.2.  Existence  of  Nash  Equilibrium  Using  Kakutani 

Prove  that  the  correspondence  /3  in  the  proof  of  Theorem  13.1  is  upper  semi- 
continuous  and  convex- valued.  Also  check  that  every  fixed  point  of  ^  is  a  Nash 
equilibrium  of  G. 

13.3.  Lemma  13.2 
Prove  Lemma  13.2. 

13.4.  Lemma  13.3 
Prove  Lemma  13.3. 

13.5.  Dominated  Strategies 

Let  (A,  B)  be  an  m  x  n  bimatrix  game.  Suppose  there  exists  a  q  £  An  such  that 
qn  —  0  and  Bq  >  Ben  (i.e.,  there  exists  a  mixture  of  the  first  n  —  1  columns  of  B  that 
is  strictly  better  than  playing  the  n- th  column). 

(a)  Prove  that  q*  —  0  for  every  Nash  equilibrium  (p*,  q*). 

Let  ( A',B ')  be  the  bimatrix  game  obtained  from  (A,  B)  be  deleting  the  last 
column. 

(b)  Prove  that  (p*,  qO  is  a  Nash  equilibrium  of  (Af , B')  if  and  only  if  (p*,  q*)  is  a 
Nash  equilibrium  of  (A,  B),  where  q;  is  the  strategy  obtained  from  q*  by  deleting 
the  last  coordinate. 
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13.6.  A  3  x  3  Bimatrix  Game 
Consider  the  3x3  bimatrix  game 


(£,B)  = 


0,4  4,0  5,3 
4,0  0,4  5,3 
3,5  3,5  6, 6 


Let  (p,  q)  be  a  Nash  equilibrium  in  (A,  B). 

(a)  Prove  that  {1,2}  %  C(p). 

(b)  Prove  that  C(p)  ^  {2,3}. 

(c)  Find  all  Nash  equilibria  of  this  game. 

13.7.  A  3  x  2  Bimatrix  Game 

Use  the  graphical  method  to  compute  the  Nash  equilibria  of  the  bimatrix  game 


(A,B)=  2,2  0,2 

\  2, 2  0,2 


13.8.  The  Nash  Equilibria  in  Example  13.18 

(a)  Compute  the  Nash  equilibria  of  the  game  in  Example  13.18. 

(b)  Show  that  the  Nash  equilibria  in  the  set  {((0, 1, 0),  (q,  1  —  q,  0))  |  0  <  q  <  1} 
are  perfect  by  using  the  definition  of  perfection. 

13.9.  Proof  of  Theorem  13.8 
Prove  Theorem  13.8. 

13.10.  Matrix  Games 

Show  that  the  pair  of  linear  programs  (13.3)  and  (13.4)  is  equivalent  to  the  LP  and 
its  dual  in  Sect.  12.2  for  solving  matrix  games. 

13.11.  Tic-Tac-Toe 

The  two-player  game  of  Tic-Tac-Toe  is  played  on  a  3  x  3  board.  Player  1  starts  by 
putting  a  cross  on  one  of  the  nine  fields.  Next,  player  2  puts  a  circle  on  one  of  the 
eight  remaining  fields.  Then  player  1  puts  a  cross  on  one  of  the  remaining  seven 
fields,  etc.  If  player  1  achieves  three  crosses  or  player  2  achieves  three  circles  in 
a  row  (either  vertically  or  horizontally  or  diagonally)  then  that  player  wins.  If  this 
does  not  happen  and  the  board  is  full,  then  the  game  ends  in  a  draw. 
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(a)  Design  a  pure  maximin  strategy  for  player  1 .  Show  that  this  maximin  strategy 
guarantees  at  least  a  draw  to  him. 

(b)  Show  that  player  1  cannot  guarantee  a  win. 

(c)  What  is  the  value  of  this  game? 


13.12.  Iterated  Elimination  in  a  Three-Player  Game 

Solve  the  following  three-player  game,  where  player  1  chooses  rows,  player  2 
columns,  and  player  3  one  of  the  two  games  L  and  R : 

l  r  l  r 

Uf  14,24,32  8,30,27  \  U  ( 16,24,30  30, 16,24  \ 

L  '  D  y  30, 16, 24  13, 12,  50 )  K  '  D  y  30,  23, 14  14,24,32;  ’ 

13.13.  Never  a  Best  Reply  and  Domination 

In  the  following  game  player  1  chooses  rows,  player  2  chooses  columns,  and  player 
3  chooses  matrices.  The  diagram  gives  the  payoffs  of  player  3.  Show  that  Y  is  never 
a  best  reply  for  player  3,  and  that  Y  is  not  strictly  (and  not  even  weakly)  dominated. 


L  R 


V  : 


U  /  9  0 

D\  0  0 


L  R 


X  : 


U  I  0  0 

D\  0  9 


L  R 


W  : 


U  /  0  9 

D  \  9  0 


L  R 


Y  : 


U  /  6  0 

D\  0  6 


13.14.  Completely  Mixed  Nash  Equilibria  Are  Perfect 
Prove  Lemma  13.16. 


13.15.  A  3 -Player  Game  with  an  Undominated  But  Not  Perfect  Equilibrium 
Consider  the  following  3-player  game,  where  player  1  chooses  rows,  player  2 
columns,  and  player  3  matrices: 

l  r  l  r 

U  (  1,1,1  1,0, 1\  U  fl,  1,0  0,0, 0\ 

d  vi, 1,1  0,0, 1;  d  vo, 1,0  1,0,0;’ 


(a)  Show  that  ( U ,  /,  L)  is  the  only  perfect  Nash  equilibrium  of  this  game. 

(b)  Show  that  ( D ,  /,  L)  is  an  undominated  Nash  equilibrium. 


13.16.  Existence  of  Proper  Equilibrium 

Prove  that  the  correspondence  F  in  the  proof  of  Theorem  13.29  satisfies  the 
conditions  of  the  Kakutani  Fixed  Point  Theorem. 
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13.17.  Strictly  Dominated  Strategies  and  Proper  Equilibrium 
Consider  the  3 -person  game 

l  r  l  r 

U  (  1,1,1  0,0,  l\  U  ( 0,0,0  0,0, 0\ 

D  VO, 0,1  0,0, iy  D  y 0,0,0  1,1,07’ 

where  player  1  chooses  rows,  player  2  chooses  columns,  and  player  3  chooses 
matrices. 


(a)  First  assume  that  player  3  is  a  dummy  and  has  only  one  strategy,  namely  L. 
Compute  the  perfect  and  proper  Nash  equilibrium  or  equilibria  of  the  game. 

(b)  Now  suppose  that  player  3  has  two  pure  strategies.  Compute  the  perfect  and 
proper  Nash  equilibrium  or  equilibria  of  the  game.  Conclude  that  adding  a 
strictly  dominated  strategy  (namely,  R)  has  resulted  in  an  additional  proper 
equilibrium. 

13.18.  Strictly  Perfect  Equilibrium 

(a)  Show  that  a  completely  mixed  Nash  equilibrium  in  a  finite  game  G  is  strictly 
perfect. 

(b)  Show  that  a  strict  Nash  equilibrium  in  a  game  G  is  strictly  perfect.  (A  Nash 
equilibrium  is  strict  if  any  unilateral  deviation  of  a  player  leads  to  a  strictly 
lower  payoff  for  that  player.) 

(c)  Compute  all  Nash  equilibria,  perfect  equilibria,  proper  equilibria,  and  strictly 
perfect  equilibria  in  the  following  game,  where  a,  /3  >  0.  (Conclude  that  strictly 
perfect  equilibria  may  fail  to  exist.) 


(A,B) 


L  M  R 

U  f  0,0  a,0  0,0\ 
D  ^0,^  0,0  of,  0  )  ' 


13.19.  Correlated  Equilibria  in  the  Two-Driver  Example  (1) 
Compute  all  correlated  equilibria  in  the  game 


c  s 


(A.B) 


c  /  —10,  —10 
s  {  0,5 


5,0  \ 

-r-i )’ 


by  using  the  definition  of  correlated  equilibrium. 


13.20.  Nash  Equilibria  Are  Correlated 

Let  (p,  q)  be  a  Nash  equilibrium  in  the  m  x  n  bimatrix  game  ( A,  B ).  Let  P  —  (py )  be 
the  m  x  n  matrix  defined  by  py  —  ptqj  for  all  i  —  l, ...  ,m  and  j  —  1 , ...  ,n.  Show 
that  P  is  a  correlated  equilibrium. 
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13.21.  The  Set  of  Correlated  Equilibria  Is  Convex 

Show  that  the  set  of  correlated  equilibria  in  a  bimatrix  game  (A,  B )  is  convex. 

13.22.  Correlated  vs.  Nash  Equilibrium 
Consider  the  bimatrix  game 


(£,B) 


1 6,6  2,7\ 
\7,  2  0,0 ) 


and  the  correlated  strategy 


(a)  Compute  all  Nash  equilibria  of  (A,  B). 

(b)  Show  that  P  is  a  correlated  equilibrium  and  that  the  associated  payoffs  fall 
outside  the  convex  hull  of  the  payoff  pairs  associated  with  the  Nash  equilibria 
of  (A,B). 

13.23.  Correlated  Equilibria  in  the  Two-Driver  Example  (2) 

Consider  again  the  game  of  Problem  13.19  and  set  up  the  associated  matrix  C  as  in 
Sect.  13.7.  Show  that  the  value  of  the  matrix  game  C  is  equal  to  3,  and  that  player  1  in 
C  has  a  unique  optimal  strategy.  (Hence,  this  method  gives  one  particular  correlated 
equilibrium.) 

13.24.  Finding  Correlated  Equilibria 

Compute  (the)  correlated  equilibria  in  the  following  game  directly,  and  by  using  the 
associated  matrix  game. 


(A.B) 


r  2' 

1  (5,2  1,3\ 

2  (2.3  4.1)  • 


13.25.  Nash,  Perfect,  Proper,  Strictly  Perfect,  and  Correlated  Equilibria 

/ 0,  6  0,4  6,  0\ 

Consider  the  bimatrix  game  (A,  B)  —  14,  0  0,0  4, 0  ]  - 

\ 6, 0  0,4  0,6/ 

Let  (p,  q)  G  A3  x  A3  be  a  Nash  equilibrium  in  ( A,  B ). 


(a)  Show  that,  if  pi  —  0,  then  p  =  (0,  1, 0). 

(b)  Show  that,  if  p\  >0  and p?,  >  0,  then  q  =  (0,1, 0). 

(c)  Show  that  in  each  Nash  equilibrium  at  least  one  player  has  payoff  0. 

(d)  Compute  all  Nash  equilibria  of  ( A,B ). 
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(e)  Which  Nash  equilibria  are  perfect? 

(f)  Which  Nash  equilibria  are  proper?  Strictly  perfect? 

/  0  a  0 

Consider  the  correlated  strategy  P  —  1/3  0  y 

\0  8  0 

with  ot,f),y,8  >0  and  a+f}  +  y+  8  =  l. 

(g)  Under  which  conditions  is  P  a  correlated  equilibrium? 

(h)  Find  a  correlated  equilibrium  with  payoff  3  for  player  1  and  1  for  player  2. 

13.26.  Independence  of  the  Axioms  in  Corollary  13.40 

Show  that  the  three  conditions  in  Corollary  13.40  are  independent:  for  each  pair  of 
conditions,  exhibit  a  solution  that  satisfies  these  two  conditions  but  not  the  third  one. 

13.27.  Inconsistency  of  Perfect  Equilibria 
Consider  the  3 -person  game  Go 

L  R  L  R 

T  (  1,1,1  1,0, 1\  T  /  0,1,0  0,0, 0\ 

B  \1, 1,1  0,0,  iy  B\  1,1,0  0,0,  OJ 

where  player  1  chooses  rows,  player  2  columns,  and  player  3  matrices.  Let  r  consist 
of  this  game  and  all  its  reduced  games.  Use  this  collection  to  show  that  the  perfect 
Nash  equilibrium  correspondence  is  not  consistent. 


13.10  Notes 

The  result  that  every  finite  game  has  a  Nash  equilibrium  in  mixed  strategies  is 
due  to  Nash  (1951).  For  the  quadratic  programming  problem  in  Sect.  13.2.3,  see 
Mangasarian  and  Stone  (1964).  Lemke  and  Howson  (1964)  provide  an  algorithm  to 
find  at  least  one  Nash  equilibrium  of  a  bimatrix  game.  See  von  Stengel  (2002)  for 
an  overview. 

For  the  first  proof  of  Theorem  13.10  see  Zermelo  (1913).  Theorem  13.13  is  due 
to  Pearce  (1984).  For  the  concept  of  rationalizability,  see  Bernheim  (1984)  and 
Pearce  (1984).  For  a  study  of  the  assumptions  underlying  the  procedure  of  iterated 
elimination  of  strictly  dominated  strategies  see  Tan  and  Werlang  (1988)  or  Perea 
(2001).  Perea  (2012)  presents  a  recent  overview  on  epistemic  game  theory. 

The  concept  of  perfect  equilibrium  (trembling  hand  perfection)  is  due  to  Selten 
(1975).  The  notions  of  ^-perfect  and  (e-)proper  equilibrium  were  introduced  in 
Myerson  (1978).  Theorem  13.20  is  based  on  Selten  (1975)  and  Myerson  (1978) 
and  appears  as  Theorem  2.2.5  in  van  Damme  (1991).  The  notion  of  strictly  perfect 
equilibrium  was  introduced  in  Okada  (1981). 
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Correlated  equilibria  were  introduced  in  Aumann  (1974).  Our  treatment  of  the 
topic  closely  follows  the  presentation  in  Owen  (1995);  Example  13.34  corresponds 
to  Example  VIII.4.4  in  that  book. 

For  theories  of  strategic  learning,  see  Young  (2004).  Section  13.8  is  based  on 
Peleg  and  Tijs  (1996);  the  presentation  there  is  for  more  general  games.  See  Norde 
et  al.  (1996)  for  a  study  of  the  notion  of  consistency  for  refinements  of  Nash 
equilibrium. 

Problem  13.12  is  from  Watson  (2002).  Problem  13.13  is  from  Fudenberg 
and  Tirole  (1991).  Problems  13.15  and  13.17  are  based  on  van  Damme  (1991). 
Problem  13.22  is  based  on  Aumann  (1974). 
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A  game  in  extensive  form  specifies  when  each  player  in  the  game  has  to  move,  what 
his  information  is  about  the  sequence  of  previous  moves,  which  chance  moves  occur, 
and  what  the  final  payoffs  are.  Such  games  are  discussed  in  Chaps.  4  and  5,  and  also 
occur  in  Chaps.  6  and  7.  The  present  chapter  extends  the  material  introduced  in 
Chaps.  4  and  5,  and  it  may  be  useful  to  (re)read  these  chapters  before  continuing. 

Section  14.1  formally  introduces  extensive  form  structures  and  games,  and 
Sect.  14.2  introduces  behavioral  strategies  and  studies  the  relation  between  behav¬ 
ioral  and  mixed  strategies.  Section  14.3  is  on  Nash  equilibrium  and  its  main 
refinements,  namely  subgame  perfect  equilibrium  and  sequential  equilibrium.  The 
latter  concept  generalizes  perfect  Bayesian  equilibrium. 


1 4.1  Extensive  Form  Structures  and  Games 

An  extensive  form  game  is  based  on  a  directed  rooted  tree.  A  directed  rooted  tree  is 
a  pair  T  —  ( X ,  E ),  where: 

•  X  is  a  finite  set  with  \X\  >  2.  The  elements  of  X  are  called  nodes. 

•  E  is  a  subset  of  X xX.  The  elements  of  E  are  called  edges.  An  edge  e  =  (x,  y)  e  E 
is  called  an  outgoing  edge  of  x  and  an  ingoing  edge  of  y. 

•  There  is  an  xq>  e  X,  called  the  root ,  such  that  for  each  x  e  X  \  {vo}  there 
is  a  unique  path  from  Vo  to  v.  Here,  a  path  from  vo  to  v  is  a  series  of  edges 
Oo,  *i),  (*i, X2), . . . ,  (xk-\ , xk),  (xk, x)  for  some  k  >  0. 

•  Vo  has  no  ingoing  edges. 

These  conditions  imply  that  each  node  which  is  not  the  root,  has  exactly  one  ingoing 
edge.  Moreover,  there  are  nodes  which  have  no  outgoing  edges.  These  nodes  are 
called  end  nodes.  The  set  of  end  nodes  is  denoted  by  Z(CX). 
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An  extensive  form  structure  is  a  tuple  S  —  ( T ,  N,P,H,  A,  r),  where: 

•  T  —  (X,  E )  is  a  directed  rooted  tree  with  root  xo  and  set  of  end  nodes  Z. 

•  N  =  {l, ...  ,n}  with  n  >  1  is  the  set  of  players. 

•  P:X\Z— ►AL^Cjisa  function  assigning  to  each  non-end  node  either  a  player 
or  Chance  C.  If  P(x )  is  a  player,  then  node  v  is  a  decision  node  of  player  P(x ), 
otherwise  v  is  a  chance  node. 

•  7/  =  (Hj)i<=N  where  for  each  i  e  N ,  //;  is  a  partition  of  the  set  P_1  (/)  of  decision 
nodes  of  player  i.  The  sets  h  G  are  called  information  sets  of  player  i.  Each 
h  G  Hi  is  assumed  to  satisfy  (i)  every  path  in  T  intersects  h  at  most  once  and  (ii) 
every  node  in  h  has  the  same  number  of  outgoing  edges. 

•  A  —  ( A(h))heH ,  where  //  =  and  for  each  h  G  //,  A(/f)  is  a  partition 

of  the  set  of  edges  outgoing  from  nodes  x  e  h.  The  partition  A(h )  is  such  that 
for  each  x  e  h  and  each  a  G  A(h),  a  contains  exactly  one  edge  outgoing  from  x. 
Every  set  a  G  A(h)  is  called  an  action  at  h.  It  is  assumed  that  \A(h)\  >  2  for  each 
heH. 

•  r  assigns  to  each  chance  node  a  probability  distribution  over  the  set  of  outgoing 
edges,  where  it  is  assumed  that  all  these  probabilities  are  positive. 

In  Fig.  14.1 — which  is  a  partial  reproduction  of  Fig.  4.1 — these  concepts  are 
illustrated. 

In  this  extensive  form  structure,  the  directed  rooted  tree  has  14  nodes 
xo,x\, . . .  ,xu  and,  consequently,  13  edges.  The  set  of  end  nodes  is  Z  = 
{^6, . . . ,  X13}  and  the  player  set  is  A  =  {1,2}.  The  function  P  :  {vo, . . . , X5}  -> 
{1,2}  U  {C}  is  defined  by  P(x)  —  1  for  x  E  {^0,^5},  P(x)  —  2  for  x  E  {x2,X3,X4}, 
and  E(vi)  =  C.  The  information  sets  are  {vo},  {^5}  e  H\  and  te,^},  {x*}  e  H2. 
The  actions  for  player  1  are:  {(vo,V2)},  {(vo,V3)},  {(xo,xi)}  G  A({vo})  and 
{(*5, *12)},  {(*5, *13)}  G  A({v5}).  The  actions  for  player  2  are:  {(x2,x6),  (x3,x8)}, 
{(x2,x7),  (x3,x9)}  G  A({x2,x3})  and  {(x4,xi0)},  {(14A11)}  G  A({v4}).  Finally, 
r(x\)  =  (1/4,  3/4),  where  1/4  is  the  probability  of  (vi,v4)  and  3/4  is  the 
probability  of  ( x\ ,  V5) . 


1  ^0 


Fig.  14.1  An  extensive  form  structure 
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Fig.  14.2  Part  of  an  extensive  form  structure  without  perfect  recall 


Clearly,  this  formal  notation  is  quite  cumbersome  and  we  try  to  avoid  it  as  much 
as  possible.  It  is  only  needed  to  give  precise  definitions  and  proofs. 

It  is  usually  assumed  that  an  extensive  form  structure  S  satisfies  perfect  recall : 
this  means  that  each  player  always  remembers  what  he  did  in  the  past.  The  formal 
definition  is  as  follows. 

Definition  14.1  An  extensive  form  structure  S  satisfies  perfect  recall  for  player 
i  e  N  if  for  every  information  set  h  e  Ht  and  each  pair  of  nodes  x,y  e  h,  player  V s 
outgoing  edges  on  the  path  from  the  root  to  v  belong  to  the  same  player  i  actions  as 
player  V s  outgoing  edges  on  the  path  from  the  root  to  y.  In  other  words,  an  action  of 
player  i  is  on  the  path  to  v  if  and  only  if  it  is  on  the  path  to  y.  We  say  that  S  satisfies 
perfect  recall  if  it  satisfies  perfect  recall  for  every  player.  □ 

Figure  14.2  shows  part  of  an  extensive  form  structure  without  perfect  recall. 

The  condition  of  perfect  recall  plays  an  important  role  for  the  relation  between 
mixed  and  behavioral  strategies  (see  Sect.  14.2). 

We  also  repeat  the  definitions  of  perfect  and  imperfect  information  (cf.  Chap.  4). 

Definition  14.2  An  extensive  form  structure  S  has  perfect  information  if  for  every 
i  e  N  and  h  e  \h\  =  1.  Otherwise,  S  has  imperfect  information.  □ 

We  conclude  this  section  with  the  formal  definition  of  an  extensive  form  game.  An 
extensive  form  game  r  is  an  n  +  1  tuple  r  —  (S,  u\, ...  ,un),  where  S  is  an 
extensive  form  structure  and  for  each  player  i  e  N,  ut  :  Z  ->  R.  The  function  ui  is 
player  V  s  payoff  function. 

A  game  r  —  (S,u\, ...  ,un)  has  (im)perfect  information  if  S  has  (im)perfect 
information. 


14.2  Pure,  Mixed  and  Behavioral  Strategies 

Let  S  =  ( T,N ,  P,7~l,A,  r)  be  an  extensive  form  structure.  A  pure  strategy  Si  of 
player  i  e  N  is  a  map  assigning  an  action  a  e  A(h)  to  every  information  set  h  e  //;. 
By  S  i  we  denote  the  (finite)  set  of  pure  strategies  of  player  i. 
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Any  pure  strategy  combination  (s\, . . . ,  sn),  when  played,  results  in  a  (unique) 
probability  distribution  over  the  end  nodes  of  the  game  tree.  (Of  course,  if  there 
are  no  chance  moves  then  this  probability  distribution  is  degenerate.)  Therefore, 
with  each  extensive  form  game  r  —  ( S ,  «i,  ...,«„)  we  can  associate  (with  a  slight 
abuse  of  notation)  a  strategic  form  game  G(T )  =  (Si, . . . ,  Sn,  u\, . . . ,  un)  in  the 
obvious  way:  if  the  pure  strategy  combination  (s\, ...  ,sn)  generates  a  probability 
distribution  ( qz)zez  over  the  end  nodes  of  the  game  tree,  then  player  i  receives 

Qzui(z)-  A  (mixed)  strategy  of  player  i  e  N  is  an  element  of  A(S/),  i.e.,  a 
probability  distribution  over  the  elements  of  S/. 

When  considering  an  extensive  form  game  (structure)  it  seems  more  natural  to 
consider,  instead  of  mixed  strategies,  so-called  behavioral  strategies.  A  behavioral 
strategy  of  a  player  assigns  to  each  information  set  of  that  player  a  probability 
distribution  over  the  actions  at  that  information  set.  Formally,  we  have  the  following 
definition. 

Definition  14.3  Let  S  —  ( T,N,P,H,A,r )  be  an  extensive  form  structure.  A 
behavioral  strategy  of  player  i  e  N  is  a  map  b[  assigning  to  each  information  set 
h  e  Hi  a  probability  distribution  over  the  set  of  actions  AQi).  □ 

Given  a  behavioral  strategy  there  is  an  obvious  way  to  define  an  associated  mixed 
strategy:  for  each  pure  strategy,  simply  multiply  all  probabilities  assigned  by  the 
behavioral  strategy  to  the  actions  occurring  in  the  pure  strategy.  Consider  for 
instance  the  extensive  form  structure  in  Fig.  14.3.  In  this  diagram  a  behavioral 
strategy  b\  of  player  1  is  indicated.  The  associated  mixed  strategy  0\  assigns  the 
probabilities  cri(A,  /)  =  |  |  oq(A,  r)  =  |  •  |  =  G\  (B,  /)  =  L  .  1  =  anc[ 

r)  —  \  Strategy  0\  is  the  ‘right’  mixed  strategy  associated  with  b\ 

in  the  following  sense.  Suppose  player  2  plays  the  mixed  or  behavioral  strategy — 
there  is  no  difference  in  this  case — which  puts  probability  ot  on  L  and  1  —  a  on  R.  If 
player  1  plays  the  behavioral  strategy  b\  then  the  probability  distribution  generated 
over  the  end  nodes  of  the  game  is  v5  i->  ^  •  ot,  |  •  a,  x7  i-^  |  •  (1  -  Of), 

jcs  |  •  (1  —  of).  The  same  distribution  is  generated  by  the  mixed  strategy  <7\ .  For 
example,  the  probability  that  V5  is  reached  equals  o\  (A,  /)  •  of  =  ^  •  of,  etc.  We  call  b\ 
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Fig.  14.3  From  behavioral  to  mixed  strategies 
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and  G\  outcome  equivalent.  Obviously,  it  would  have  been  sufficient  to  check  this 
for  the  two  pure  strategies  L  and  R  for  player  2. 

We  summarize  these  considerations  in  a  definition  and  a  proposition,  which  is 
presented  without  a  formal  proof. 

Definition  14.4  Two  (behavioral  or  mixed)  strategies  of  player  i  in  S  are  outcome 
equivalent  if  for  each  pure  strategy  combination  s-i  of  the  other  players  the 
probability  distributions  generated  by  the  two  strategies  over  the  end  nodes  are 
equal.  □ 

Proposition  14.5  Let  bi  be  a  behavioral  strategy  of  player  i  in  S.  Then  there  is 
a  mixed  strategy  07  of  player  i  that  is  outcome  equivalent  to  bi.  Such  a  strategy 
07  is  obtained  by  assigning  to  each  pure  strategy  Si  of  player  i  the  product  of  the 
probabilities  assigned  by  bi  to  the  actions  chosen  by  s^ 

It  should  be  noted  that  there  is  not  necessarily  a  unique  mixed  strategy  that  is 
outcome  equivalent  to  a  given  behavioral  strategy.  For  instance,  in  the  example 
above,  if  we  change  the  behavioral  strategy  of  player  1  such  that  it  assigns 
probability  0  to  action  A  and  probability  1  to  action  B  at  information  set  {xo}, 
then  all  mixed  strategies  which  put  zero  probability  on  (A,  /)  and  (A,  r)  are  outcome 
equivalent,  resulting  in  each  end  node  other  than  x: 2  with  zero  probability. 

Also  for  the  converse  question  of  how  to  associate  an  outcome  equivalent 
behavioral  strategy  with  a  mixed  strategy,  it  is  not  hard  to  figure  out  a  procedure. 
Suppose  that  G\  is  a  mixed  strategy  of  player  /,  h  is  an  information  set  of  player  /, 
and  a  is  an  action  in  h.  First,  let  Sfh)  denote  the  set  of  pure  strategies  of  player  i 
such  that  the  play  of  the  game  possibly  reaches  h,  in  other  words,  such  that  there 
exists  a  path  through  h  containing  the  actions  prescribed  by  the  pure  strategy  under 
consideration.  Then  GfSfh))  is  the  total  probability  assigned  by  07  to  this  set  of 
pure  strategies.  Within  this  set,  consider  those  pure  strategies  that  assign  a  to  h 
and  divide  their  total  probability  by  ofSfh))  if  GfSiih ))  >  0:  the  result  is  defined 
to  be  bi(h)(a).  Thus,  bi(h)(a)  is  the  probability  of  a  being  played  conditional  on 
the  set  h  being  reached — it  is  a  conditional  probability.  If  07  (5)  (/z))  —  0  then  we 
can  choose  bfh)  arbitrary.  This  way,  we  construct  a  behavioral  strategy  bi  that  is 
outcome  equivalent  to  the  mixed  strategy  07. 

As  an  illustration  consider  the  extensive  form  structure  in  Fig.  14.4,  which  is  the 
same  as  the  one  in  Fig.  14.3. 

Consider  the  mixed  strategy  G\  of  player  1  defined  by:  (A,  /)  ++  |,  (A,  r)  ++ 

(B,  l)  ++  |,  (B,  r)  ++  y|.  Following  the  above  procedure  we  obtain 

a\{A,1)+a\{A,r)  1  ,  J_  _3_ 

1  5  10  10 

ai(B.t)+ai(B,r)  _  2  ,  _3_  _  1_ 

1  5  1 0  10 

o-i  (A,/)  _  1/5  _  2 

ox  (A,t)+ai(A,r)  ~  1/5+1/10  —  3 

o-i  (A, r)  1/10  1 

<7i(A,/)+<7i(A,r)  1/5+1/10  3  ' 


bi(A)  = 
bi(B)  = 

bli  D  = 

b\(r)  = 
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1  *o 


Fig.  14.4  From  mixed  to  behavioral  strategies 


It  is  straightforward  to  verify  that  b\  and  G\  are  outcome  equivalent. 

Outcome  equivalence  is  not  guaranteed  without  perfect  recall:  see  Problem  14.2 
for  an  example.  With  perfect  recall,  we  have  the  following  theorem. 

Theorem  14.6  (Kuhn)  Let  the  extensive  form  structure  S  satisfy  perfect  recall 
Then,  for  every  player  i  and  every  mixed  strategy  07  there  is  a  behavioral  strategy  bi 
that  is  outcome  equivalent  to  07. 

The  behavioral  strategy  bi  in  Theorem  14.6  can  be  constructed  as  described  in  the 
text. 

14.3  Nash  Equilibrium  and  Refinements 

Let  T  —  (<S,  ( Ui)ieN )  be  an  extensive  form  game  with  associated  strategic  form 
game  G(C)  =  ((Si)ieN,  We  assume  that  S  satisfies  perfect  recall. 

A  pure  strategy  Nash  equilibrium  of  T  is  defined  to  be  a  pure  strategy  Nash 
equilibrium  of  G(C).  Note  that,  if  T  has  perfect  information,  then  a  pure  strategy 
Nash  equilibrium  exists  (cf.  Chap.  4). 

A  mixed  strategy  Nash  equilibrium  of  T  is  defined  to  be  a  (mixed  strategy)  Nash 
equilibrium  of  G(C).  By  Theorem  13.1  such  an  equilibrium  always  exists. 

Consider,  now,  a  behavioral  strategy  combination  b  —  ( bi)ieN  in  T .  Such  a 
strategy  combination  generates  a  probability  distribution  over  the  end  nodes  and, 
thus,  an  expected  payoff  for  each  player.  We  call  bt  a  best  reply  of  player  i  £  N 
to  the  strategy  combination  b-[  if  there  is  no  other  behavioral  strategy  b[  of  player 
i  such  that  ( b\,b-i )  generates  a  higher  expected  payoff  for  player  i.  We  call  b  a 
Nash  equilibrium  (in  behavioral  strategies)  of  T  if  bi  is  a  best  reply  to  b-i  for  every 
player  i  £  N.  (Thus,  observe  that  a  Nash  equilibrium  of  an  extensive  form  game  is 
by  definition  a  Nash  equilibrium  in  behavioral  strategies.) 

Let  a  be  a  mixed  strategy  Nash  equilibrium  of  T .  By  Theorem  14.6  there  is 
a  behavioral  strategy  combination  b  that  is  outcome  equivalent  to  a.  We  claim 
that  b  is  a  Nash  equilibrium  of  T .  Suppose  not,  then  there  is  a  player  i  £  N  and 
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a  behavioral  strategy  b[  that  gives  player  i  a  higher  expected  payoff  against  b-t. 
By  Proposition  14.5  there  is  a  mixed  strategy  o[  that  is  outcome  equivalent  to  b[. 
Consequently,  o[  gives  player  i  a  higher  expected  payoff  against  cr_;,  a  contradiction. 
We  have  thus  proved: 

Theorem  14.7  Every  extensive  form  game  has  a  Nash  equilibrium. 

In  fact,  a  similar  argument  as  the  one  leading  to  this  theorem  can  be  applied  to  show 
that  every  Nash  equilibrium  (in  behavioral  strategies)  results  in  a  Nash  equilibrium 
in  mixed  strategies.  Hence,  one  way  to  find  the  (behavioral  strategy)  Nash  equilibria 
of  an  extensive  form  game  is  to  determine  all  (mixed  strategy)  Nash  equilibria  of 
the  associated  strategic  form  game.  Which  way  is  most  convenient  depends  on  the 
game  at  hand.  In  particular  for  refinements  it  is  often  easier  to  compute  behavioral 
equilibrium  strategies  directly,  without  first  computing  the  mixed  strategy  Nash 
equilibria.  Before  discussing  these  refinements  we  first  consider  an  example. 

Example  14.8  Consider  the  extensive  form  game  C]  in  Fig.  14.5. 

This  game  is  based  on  the  extensive  form  structure  of  Fig.  14.4,  in  which  the 
symbols  for  the  end  nodes  are  replaced  by  payoffs  for  player  1  (upper  number) 
and  player  2  (lower  number).  The  associated  strategic  form  of  this  game  is  given  in 
Table  14. 1 .  (Note  that  in  G(f i)  there  is  no  essential  difference  between  BC  and  BD.) 


1  *0 


Fig.  1 4.5  The  game  C 


L 

R 

AC 

/  8,  0 

0,  8  \ 

AD 

0,8 

8,0 

BC 

6,0 

6,0 

BD 

1^6,0 

6,0/ 

Table  14.1  The  strategic  form  G(C)  of  Fi 
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To  find  the  Nash  equilibria  in  G(Fi),  first  note  that  player  2  will  never  play  pure 
in  a  Nash  equilibrium.  Suppose,  in  equilibrium,  that  player  2  plays  (a,  1  —  a)  with 
0  <  a  <  1,  and  player  1  plays  p  =  (pi,P2,P3,Pa),  where p\  is  the  probability  of 
AC,  p2  the  probability  of  AO,  p 3  the  probability  of  BC ,  and  P4  the  probability  of 
BD.  Since  0  <  a  <  1,  player  2  is  indifferent  between  L  and  R ,  which  implies  that 
pi  —  P2.  Suppose  pi  —  P2  >  0-  Then  we  must  have  Sa  >  6  and  8(1  —  a)  >  6, 
which  is  impossible.  Hence  p3  -j-  p4  =  1  and  both  6  >  8a  and  6  >  8(1  —  a),  so 
1/4  <  a  <  3/4.  This  implies  that  b  —  (^1,^2)  is  a  (behavioral  strategy)  Nash 
equilibrium  of  r  if  and  only  if 

bi(A)  =  0,  bi(B)  =  1,  1/4  <  b2(L)  <  3/4  . 

So  bi  (C)  may  take  any  arbitrary  value.  □ 

In  the  remainder  of  this  section  we  consider  refinements  of  Nash  equilibrium. 


14.3.1  Subgame  Perfect  Equilibrium 

Let  v  be  a  non-end  node  in  an  extensive  form  structure  S  and  let  Tx  —  ( Vx ,  Ex )  be 
the  subtree  starting  from  v — i.e.,  Vx  is  the  subset  of  V  consisting  of  {v}  and  all  nodes 
of  V  that  can  be  reached  by  a  path  starting  from  jt,  and  Ex  is  the  subset  of  E  of  all 
edges  between  nodes  in  Vx.  If  every  information  set  of  S  is  contained  either  in  Vx 
or  in  V  \  Vx  (this  implies,  in  particular,  that  {v}  is  a  singleton  information  set),  then 
we  call  Sx,  the  restriction  of  S  to  Tx ,  a  substructure.  Then,  for  the  extensive  form 
game  r  —  (S,  (uj)ie N),  the  game  rx  =  ( Sx ,  (w/)^^)  is  defined  by  restricting  the 
payoff  functions  to  the  end  nodes  still  available  in  Vx.  We  call  rx  a  subgame  of  F. 
For  a  behavioral  strategy  combination  b  =  ( bi)ieN  we  denote  by  bx  —  ( bx)teN  the 
restriction  to  the  substructure  Sx. 

Definition  14.9  A  behavioral  strategy  combination  b  in  r  is  a  subgame  perfect 
equilibrium  if  bx  is  a  Nash  equilibrium  for  every  subgame  F v.  □ 

Clearly,  this  definition  extends  the  definition  of  subgame  perfection  for  pure  strategy 
combinations  given  in  Chap.  4. 

Since  the  whole  game  F  is  a  subgame  (F  =  Fx°,  where  vo  is  the  root  of  the 
game  tree),  every  subgame  perfect  equilibrium  is  a  Nash  equilibrium.  By  carrying 
out  a  backward  induction  procedure  as  in  Sect.  4.3,  it  can  be  seen  that  a  subgame 
perfect  equilibrium  exists  in  any  extensive  form  game. 

Subgame  perfection  often  implies  a  considerable  reduction  of  the  set  of  Nash 
equilibria.  The  following  example  is  the  continuation  of  Example  14.8. 

Example  14.10  To  find  the  subgame  perfect  equilibria  in  Fi,  we  only  have  to 
analyze  the  subgame  F/1 .  It  is  easy  to  see  that  this  subgame  has  a  unique  Nash 
equilibrium,  namely  player  2  playing  L  and  R  each  with  probability  1/2,  and  player 
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1  playing  C  and  D  each  with  probability  1/2.  This  results  in  a  unique  subgame 
perfect  equilibrium  b  —  (b\ ,  b?)  given  by 

bi(B)  =  1,  MO  =  1/2,  ML)  =  1/2. 

To  reach  this  conclusion  we  have  used  that  in  any  Nash  equilibrium  player  1  plays 
B ,  as  established  in  Example  14.8.  This,  however,  can  also  be  seen  from  the  fact  that 
the  only  Nash  equilibrium  in  the  subgame  starting  at  x\  gives  player  1  a  payoff  of  4, 
which  is  smaller  than  the  payoff  of  6  which  player  1  obtains  from  playing  B.  In  fact, 
this  argument  is  a  kind  of  ‘backward  induction’,  where  in  the  first  step  we  analyze 
the  whole  game  C/1  •  □ 

14.3.2  Perfect  Bayesian  and  Sequential  Equilibrium 

In  games  without  proper  subgames  and  in  games  of  imperfect  information  the 
subgame  perfection  requirement  may  not  have  much  bite  (see  the  examples  in 
Chaps.  4  and  5).  The  concepts  of  perfect  Bayesian  and  sequential  equilibrium 
allow  to  distinguish  between  Nash  equilibria  by  considering  beliefs  of  players  on 
information  sets. 

Consider  an  extensive  form  structure  S  —  (T,  N,  P,  7~L,  A,  r).  A  belief  system  ft 
assigns  to  every  information  set  h  e  H  —  UieNHi  a  probability  distribution  fth  over 
the  nodes  in  h.  An  assessment  is  a  pair  (b,  ft)  of  a  behavioral  strategy  combination 
b  =  (bi)ieN  and  a  belief  system  ft. 

The  first  requirement  we  consider  is  that  of  sequential  rationality ,  which  requires 
a  player  at  an  information  set  to  choose  only  those  actions  that  yield  maximal 
expected  payoff,  given  that  player’s  beliefs.  Consider  an  extensive  form  game 
r  —  (<S,  (ui)iei\r).  Let  (b,  ft)  be  an  assessment.  Let  i  e  N,  h  e  Hi,  a  e  A(h ), 
and  x  e  h.  Suppose  player  i  is  at  node  x  and  takes  action  a.  This  corresponds  to 
an  edge  (x,  y)  in  the  game  tree.  Then  each  end  node  on  a  path  starting  from  x  and 
passing  though  y  is  reached  with  a  probability  that  is  equal  to  the  product  of  the 
probabilities  of  all  edges  on  this  path  following  y,  given  by  b  and  the  probabilities  of 
eventual  chance  nodes  on  this  path.  This  way,  we  can  compute  the  expected  payoff 
to  player  i  from  playing  a ,  conditional  on  being  at  node  x:  denote  this  payoff  by 
Ui(a\b,x).  Player  1  s  expected  payoff  from  action  a ,  given  information  set  h ,  is  then 
equal  to  J2xeh  M)MM,  *)■ 

Definition  14.11  An  assessment  (b,  ft)  in  r  =  ( S ,  (uftieN)  is  sequentially  rational 
if  for  every  i  e  A,  h  e  Hi ,  and  a  e  A(h)  we  have: 

bi(h)(a)  >  0  =>  /  ft(x)  Uj(a\b,x)  =  max  ft(x)  Ui(a\b,x)  . 

^  a'eAih 

x€h  x€h 


□ 
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Thus,  indeed,  sequential  rationality  of  an  assessment  (b,  /3)  means  that  a  player  puts 
only  positive  probability  on  those  actions  at  an  information  set  h  that  maximize  his 
expected  payoff,  given  h  and  his  belief  {/ 3(x )  |  x  e  h}. 

Sequential  rationality  is  defined  with  respect  to  a  given  belief  system.  We  now 
formulate  conditions  on  such  belief  systems  which,  in  turn,  are  defined  with  respect 
to  a  given  profile  of  behavioral  strategies. 

Let  v  be  a  node  in  the  game  tree  T  such  that  Sx  is  a  substructure.  For  any  node 
x'  in  the  subtree  Tx ,  let  denote  the  probability  that  x'  is  reached  in  Tx  given 

b ,  that  is,  P l(x')  is  the  product  of  the  probabilities  of  all  edges  on  the  unique  path 
from  x  to  x'\  these  probabilities  are  given  by  b ,  or  by  r  in  case  of  a  chance  node  on 
the  path.  For  every  information  set  h  of  Sx,  W  =  Ex' eh  P Xb(x')  is  the  probability 
that,  within  Sx ,  the  information  set  h  is  reached,  given  b. 

The  next  definition  is  the  formal  version  of  the  consistency  requirement  intro¬ 
duced  already  in  Chap.  4. 


Definition  14.12  An  assessment  (b,/3)  in  S  is  Bayesian  consistent  if  j 3h(x ')  = 
Fxb(x') /¥l(h)  for  all  x'  e  h ,  for  each  substructure  Sx  and  each  information  set  h 
in  Sx  for  which  P b(h)  >0.  □ 

It  is  not  difficult  to  see  that,  if  h  is  an  information  set  within  a  substructure  Sx  and 
Sx  is  in  turn  a  substructure  of  another  substructure  Sy ,  such  that  P b(h)  >  0,  then 
P  1(h)  >  0  and  P£(V)/P xb(h)  —  Fyb(x')/¥ yb(h)  for  all  x'  g  h.  In  other  words,  if 
beliefs  are  restricted  by  Bayesian  consistency,  this  restriction  does  not  depend  on 
the  particular  substructure  (or  subgame)  considered. 

By  Bayesian  consistency  the  players’  beliefs  are  determined,  as  conditional 
probabilities,  by  the  behavioral  strategies  on  all  information  sets  that  are  reached 
with  positive  probability  in  some  subgame.  This  requirement  can  be  quite  weak;  for 
instance,  it  does  not  even  imply  that  the  beliefs  of  one  and  the  same  player  have 
to  be  internally  consistent,  as  is  illustrated  by  the  following  example.  Consider  the 
extensive  form  structure  in  Fig.  14.6.  Suppose  that  player  1  plays  a  and  player  2 
plays  e.  Then  the  beliefs  of  player  2  at  the  information  sets  {x\ ,  X2)  and  {*3,  X4}  are 
not  restricted  by  Bayesian  consistency:  the  only  substructure  is  the  whole  structure, 
within  which  the  information  sets  of  player  2  are  reached  with  zero  probability. 


Fig.  1 4.6  If  player  1  plays  a, 
then  player  2’s  beliefs  at 
{x\ ,  X2)  and  {X3 ,  V4}  are 
independent  under  Bayesian 
consistency 
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Moreover,  a  belief  /3(x i)  =  =  1  is  allowed,  which  means  that  player  2’s 

beliefs  are  not  internally  consistent. 

In  many  applications  this  drawback  does  not  occur,  and  Bayesian  consistency 
is  strong  enough.  For  instance,  in  signaling  games  (see  Chap.  5)  Bayesian  consis¬ 
tency  implies  the  stronger  version  of  consistency  in  Definition  14.13  below.  See 
Problem  14.4. 

Call  a  behavioral  strategy  bi  of  player  i  completely  mixed  if  bi(h)(x)  >  0 
for  each  h  e  Hi  and  x  e  h.  A  behavioral  strategy  combination  b  —  (bfiieN  is 
completely  mixed  if  bi  is  completely  mixed  for  every  i  e  N.  Observe  that,  if  b 
is  completely  mixed  and  the  assessment  ( b ,  /3)  is  Bayesian  consistent,  then  /3  is 
uniquely  determined  by  b:  all  information  sets  in  all  substructures  are  reached  with 
positive  probability.  The  announced  stronger  version  of  consistency,  simply  called 
consistency ,  is  defined  as  follows. 

Definition  14.13  An  assessment  ( b ,  /3)  in  S  is  consistent  if  there  exists  a  sequence 
(bm,  Bm)mem  of  Bayesian  consistent  assessments  with  each  bm  completely  mixed 
and  \\mn!^00(bn\  /3m)  =  (bj).  □ 

Consistency  implies  Bayesian  consistency  (Problem  14.3).  Consistency  is  clearly 
stronger  than  Bayesian  consistency.  For  instance,  in  the  extensive  form  structure  of 
Fig.  14.6,  it  is  easily  seen  that  consistency  requires  player  2  to  have  identical  beliefs 
on  his  two  information  sets,  i.e.,  fi(x\)  —  This  is  true  even  if  on  his  right 

information  set  player  2  is  replaced  by  some  other  player  3,  as  in  Fig.  14.7. 

We  can  now  define  the  announced  equilibrium  refinements. 

Definition  14.14  An  assessment  ( b ,  /3)  in  r  —  ( S ,  ( )  is  a  perfect  Bayesian 
equilibrium  if  it  is  sequentially  rational  and  Bayesian  consistent;  it  is  a  sequential 
equilibrium  if  it  is  sequentially  rational  and  consistent.  □ 

The  following  theorem,  stated  without  a  proof,  collects  some  facts  about  perfect 
Bayesian  and  sequential  equilibrium. 

Fig.  14.7  Under  consistency, 
the  beliefs  of  players  2  and  3 
are  identical:  P(x\)  =  /3(w) 


1  See  also  the  Notes  to  this  chapter. 
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Fig.  1 4.8  The  game  of 
Example  14.16 


Theorem  14.15  Every  extensive  form  game  has  a  sequential  equilibrium.  Every 
sequential  equilibrium  is  perfect  Bayesian.  Every  perfect  Bayesian  equilibrium  is 
subgame  perfect  and,  in  particular,  a  Nash  equilibrium. 

The  following  example  shows  that  a  perfect  Bayesian  equilibrium  does  not  have  to 
be  sequential. 

Example  14.16  We  consider  again  the  extensive  form  structure  of  Fig.  14.7,  supple¬ 
mented  with  payoffs:  see  Fig.  14.8.  An  example  of  a  perfect  Bayesian  equilibrium 
in  this  game  is  the  assessment  (b,  /3)  with  fi(x\)  =  ^(xf)  =  1  and  b\(a)  — 
bi(e)  —  b$(f)  —  1.  This  equilibrium  is  not  sequential,  since  any  consistent 
assessment  should  attach  equal  belief  probabilities  to  x\  and  V3.  In  fact,  there  is 
no  sequential  equilibrium  in  which  b\(a)  —  b2{e)  —  b^{f)  —  1.  To  see  this,  let 
a  —  P(x\)  =  /3(xf).  Then  still  b\(a)  =  1 .  If  o'  <  1  /  2,  then  £>3  (/)  =  1 ,  if  o'  >  1/2, 
then  b$(f)  =  0,  and  if  a  —  1/2,  then  b$(f)  is  arbitrary.  For  player  2,  playing  d 
yields  2.  Playing  e  yields  3a  if  a  <  1  / 2,  0  if  a  >  1/2,  and  at  most  3/2  if  a  =  1/2. 
Hence,  b2{d)  =  1.  Hence,  the  sequential  equilibria  in  this  game  are  the  assessments 
( b ,  /3)  with  b\(a)  —  b2(d)  =  1,  /3(x\ )  =  P(xf)  —  a  e  [0,  1],  and  if  a  <  1/2,  then 
b^if)  —  1 ,  if  of  >  1/2,  then  £>3  (/)  =  0,  and  if  a  —  1/2,  then  b$(f)  is  arbitrary.  For 
0  <  a  <  1,  such  an  assessment  is  obtained  as  the  limit  of  assessments  (bm,  /3m)  with: 
b™(a)  —  \  —a/m—(\—a)/ m,  b1"  (b)  —  a/m,  a.ndb™(c)  =  (1  —a) /m\b™(e)  —  1/m; 
and  b™(g)  =  l/mifa  <  1/2,  b™(f)  =  1/mifaf  >  1/2,  and  b™(f)  =  b$(f) 
if  b$(f)  G  (0, 1).  If  o'  =  1,  then  bn/(a)  —  1  —  1/m  —  1/m2,  bn/(b)  —  1/m, 
b™(c)  —  1/m2;  if  of  =  0,  then  bf{a)  —  1  —  1/m  —  1/m2,  bf(b)  =  1/m2, 
bf(c)  —  1/m;  in  both  these  cases  the  behavioral  strategies  bn/  and  b f  can  remain 
unchanged.  The  belief  systems  f>m  are  determined  by  Bayesian  consistency.  □ 
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Fig.  1 4.9  Sequential  equilibrium  analysis  in  a  three-player  game 


The  advantage  of  perfect  Bayesian  equilibrium  is  that  it  avoids  the  condition 
involving  limiting  assessments  in  the  definition  of  consistency.  The  idea  behind 
perfect  Bayesian  equilibrium  is  that  beliefs  of  players  should  be  obtained  by  the 
method  of  Bayesian  updating  ‘as  much  as  possible’.  In  the  preceding  examples, 
however,  it  has  become  clear  that  the  formal  definition  of  Bayesian  consistency 
and  perfect  Bayesian  equilibrium  does  not  capture  this  completely.  (See  also  the 
Notes  to  this  chapter.)  Therefore,  we  stick  to  sequential  equilibrium  but  keep  in 
mind  that,  in  order  to  compute  sequential  equilibria  for  a  given  game,  the  method 
of  Bayesian  updating  ‘as  much  as  possible’  is  a  good  heuristic.  There  is  hardly  any 
general  method  available  to  compute  sequential  equilibria:  it  depends  very  much  on 
the  game  at  hand  what  the  best  way  is.  We  conclude  with  two  additional  examples 
and  refer  to  the  problem  section  for  other  examples. 

Example  14.17  Consider  the  three-player  game  in  Fig.  14.9.  The  extensive  form 
structure  is  similar  to  the  one  in  Example  14.16,  but  without  the  ‘outside  option’  of 
player  1.  Note  that  the  only  subgame  is  the  whole  game — the  notation  P(-)  refers 
to  the  probabilities  in  this  trivial  subgame.  To  find  the  sequential  equilibria  of  this 
game,  first  observe  that  consistency  requires  the  beliefs  of  players  2  and  3  to  be 
the  same,  so  /3  =  /?' .  For  completeness  we  spell  out  the  argument.  Denote  the 
behavioral  strategies  of  the  players  by  b\(a)  and  b\(b)  —  1  —  b\(a),  b2(c)  and 
bi(d)  —  1  —  b2(c),  and  b^{e)  and  b$(f)  —  1  —  b$(e).  Suppose  b\,  Z?2,  and  £3  are 
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completely  mixed.  Let  v  denote  the  left  node  in  player  2’s  information  set  h2.  Then 
P*(jc)  =  b\(a)  and  Fb(h2)  =  1,  so  by  Bayesian  consistency  fi  —  b\(a).  Let  y 
denote  the  left  node  in  player  3’s  information  set  h 3.  Then  P^(y)  =  b\(a)b2(c)  and 
P^(/i3)  =  b2(c ),  so  by  Bayesian  consistency  ft'  —  b\(a)b2{c)  /  b2{c)  =  b\(a).  Thus, 
ft  =  ft'  if  the  behavioral  strategies  are  completely  mixed.  By  consistency,  /3  = 
for  all  profiles  of  behavioral  strategies.  Note  that  this  is  not  necessarily  the  case 
under  only  Bayesian  consistency:  if  player  2  plays  d  with  probability  1,  then  /3f  is 
‘free’. 

Now,  starting  with  player  3,  sequential  rationality  requires  b3(e)  —  1  if  /3  > 

£>3  (e)  =  0  if  P  <  | ,  and  0  <  b3  (e)  <  1  if  /3  =  | . 

Using  this,  if  ft  >  |  then  playing  c  yields  0  for  player  2  and  playing  d  yields 
1.  Therefore  b2(c)  =  0.  Similarly,  b2(c)  =  1  if  /3  <  If  f)  =  |  then  d  yields  1 
whereas  c  yields  3b3(f).  Hence,  b2(c)  —  0  if  ^  and  b^f)  <  \\b2ic)  —  1  if 
P  =  \  and  bs  (/)  >  | ;  and  0  <  (c)  <  1  if  ft  =  \  and  ^3  (/)  =  | . 

We  finally  consider  player  1 .  If  b\  {a)  >  |  then  consistency  requires  /3  =  b\  (a)  > 

\  and  therefore  b2(c)  —  0.  So  player  1  obtains  2b\(a)  +  1(1  —  b\(a))  —  1  +  b\(a), 
which  is  maximal  for  b\  (a)  —  1 .  Obviously,  player  1  cannot  improve  on  this.  So  we 
have  the  following  sequential  equilibrium: 


bi(a)  =  1,  b2(c)  =  0,  b3(e )  =  1,  J8  =  /3'  =  1  . 

If  b\(a)  <  ^  then  ft  =  b\(a)  <  ^  and  therefore  b2(c)  —  1  and  b3(e)  =  0.  So 
player  1  obtains  \b\{a)  +  2(1  —  b\(a))  —  2  —  b\ (a),  which  is  maximal  for  b\ (a)  =  0. 
Obviously  again,  player  1  cannot  improve  on  this.  So  we  have  a  second  sequential 
equilibrium: 


b\(a)  =  0,  b2{c)  =  1,  b3(e )  =  0,  /3  =  ft  —  0  . 


If  b\  (a)  =  |  then  player  1  must  be  indifferent  between  a  and  b.  This  implies  that 
the  expected  payoff  from  a  should  be  equal  to  the  expected  payoff  from  b ,  hence 
that  2(1  —b2(c))  +  lb2(c)  —  2 b2(c)  +  1(1  —b2(c))  which  is  true  for  b2(c)  =  The 
preceding  analysis  for  player  2  shows  that  for  player  2  to  play  completely  mixed  we 
need  b3(e )  =  |.  So  we  have  a  third  sequential  equilibrium 

b\(a)  =  1,  b2(c)  =  1,  b3(e)  =  j,  =  ft  =  1  . 

Also  in  this  game  there  are  perfect  Bayesian  equilibria  which  are  not  sequential, 
e.g.,  b\(a)  —  b2(d)  —  1  ,b3(e)  —  2/3,  /3  —  1,  and  =  1/4.  □ 

Example  14.18  Consider  the  signaling  game  in  Fig.  14. 10  (this  is  the  same  game  as 
in  Fig.  5.4).  To  find  the  sequential  equilibria  in  this  game,  we  start  with  the  beliefs 
of  player  2.  Clearly,  and  independently  of  o',  sequential  rationality  requires  player  2 
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4,2 


0,0 

2,0 


2,4 


Fig.  14.10  The  game  of  Example  14.18 


to  play  u  at  his  left  information  set.  At  his  right  information  set,  u'  is  at  least  as  good 
as  d'  if  2/3  >  4(1  —  /3 ),  hence  if  /3  >  2/3.  Therefore,  we  distinguish  three  cases: 

•  /3  >  2/3.  Then  player  2  plays  the  pure  strategy  uu hence  player  1  plays 
RL\  implying  fi  =  1  and  a  —  0.  This  is  a  sequential  equilibrium  (in  fact,  a 
pure  strategy  Nash  equilibrium).  In  behavioral  strategies:  b\(R)  =  b\ (Lr)  —  1, 
b2(u)  =  b2(ur)  =  1. 

•  /3  <  2/3.  Then  player  2  plays  udf,  player  1  plays  LL' ,  o'  =  1/2,  and  /3  <  2/3.  In 
behavioral  strategies:  b\(L)  —  b\(L')  —  1,  b2(u)  =  b2(d')  =  1. 

•  =  2/3.  Again  b2(u)  =  1  implies  b\  (L')  =  1 ,  but  then  we  must  have  b\  (L)  =  1 
since  otherwise  we  would  have  fi  =  1  (by  Bayesian  consistency).  We  then  must 
have  b2 (u')  <  1  / 2  to  keep  type  t  from  deviating  to  R.  Further,  a  =  1/2  completes 
the  description  of  this  collection  of  sequential  equilibria. 

The  analysis  of  this  game  is  facilitated  by  the  fact  that  player  2  always  prefers  u  over 
d.  In  general,  analysis  of  even  these  relatively  simple  signaling  games  can  be  quite 
cumbersome.  □ 

Remark  14.19  With  the  exception  of  Example  14.16  we  did  not  really  check  that  the 
sequential  equilibria,  computed  in  the  examples,  are  indeed  consistent.  In  general, 
however,  this  is  rather  straightforward.  For  instance,  for  the  sequential  equilibria  in 
Example  14.17  we  can  proceed  as  follows,  for  each  of  the  three  equilibria: 

•  b\(a)  —  1,  b2{c)  —  0,  b2(e)  —  1,  fi  =  fi'  =  1.  For  each  me  N  take 

b™(b)  —  bnJ(c)  —  b™(f)  —  1/m.  The  associated  beliefs  are  determined  by 

Bayesian  consistency. 

•  b\{a)  —  0,  b2(c)  —  1,  b2(e)  —  0 ,  /3  =  /3'  =  0.  For  each  me  N  take 

b™(a)  —  b^id)  —  b]1/(e)  —  1/m.  The  associated  beliefs  are  determined  by 

Bayesian  consistency. 
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•  b\(a)  —  1/4, /72(c)  =  1/2,  b^ie)  —  2/3,  >8  =  =  1/4.  For  each  m  e  N 

take  &™(<z)  =1/4,  b%(c)  =  1/2,  /?3(^)  =  2/3  (hence,  constant).  The  associated 
beliefs  are  determined  by  Bayesian  consistency. 


□ 


14.4  Problems 

14.1.  Mixed  and  Behavioral  Strategies 

Determine  all  mixed  strategies  that  are  outcome  equivalent  with  the  behavioral 
strategy  represented  in  the  following  one-player  extensive  form  structure. 


14.2.  An  Extensive  Form  Structure  Without  Perfect  Recall 
Consider  the  following  extensive  form  structure: 


(a)  Show  that  this  one-player  extensive  form  structure  has  no  perfect  recall. 

(b)  Consider  the  mixed  strategy  0\  that  assigns  probability  1  /2  to  both  (L,  /) 
and  (R,  r).  Show  that  there  is  no  behavioral  strategy  that  generates  the  same 
probability  distribution  over  the  end  nodes  as  o\  does. 

14.3.  Consistency  Implies  Bayesian  Consistency 

Let  (/?,  P)  be  a  consistent  assessment  in  an  extensive  form  structure  S.  Show  that 
(b,  /3)  is  Bayesian  consistent. 
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Prove  that  Bayesian  consistency  implies  consistency  in  a  signaling  game. 

[A  general  definition  of  a  signaling  game  (cf.  also  Chap.  5)  is  as  follows.  The  set  of 
players  is  A  =  {1,2}  and  for  the  extensive  form  structure  S  we  have: 


1.  The  directed  rooted  tree  is  T  —  (X,  E)  with  root  vq, 


X  —  {-To,  X\,  ...  ,  Xfc,  X[] , 


X[l ,  X[j] 


i  Xii 


lJm 


where  k,l,m  >  2, 


E  =  {(x0,Xi),  ( Xi,Xij ),  ( Xij,Xijj> )  |  i  =  1  ,...,k,j=  1, =  1, . . .  ,m)  . 


Hence 


Z  =  {Xig 


2.  The  chance  and  player  assignment  P  is  defined  by  P(x o)  =  C,  P(xi)  =  1  for  all 
i  —  l, ...  ,k,  P(xij)  —  2  for  all  i  =  l, ...  ,kj  —  1 

3.  The  information  sets  are 


H i  =  {{xi}, . . .  ,{xk}},  H2  =  {{xy, . . .  ,xkj}  \j=  1 . 


4.  The  action  sets  are 


A({xi})  =  {{(xuXy)}  |  j  =  1, ...,/}  for  every  i  =  1, . . . ,  k 


for  player  1  and 


A({xy, . .  .,xkj})  =  {{(xy,  xyf), ...,  (xkj,xkjf)}  |  j  =  1  ,...,m} 

for  every  j  —  1 for  player  2. 

5.  The  map  r  assigns  a  positive  probability  to  each  edge  in  the  set 
{(vo,  vi), . . . ,  (*o, Xk)}.  (Player  1  has  k  ‘types’.) 

Finally,  the  players  have  payoff  functions  u\,  u2  :  Z  ->  M.] 

14.5.  Sequential  Equilibria  in  a  Signaling  Game 

Compute  the  sequential  equilibria  in  the  signaling  game  in  Fig.  14.11  (this  is  the 
game  from  Fig.  5.8). 

14.6.  Computation  of  Sequential  Equilibrium  (1) 

Compute  the  sequential  equilibrium  or  equilibria  in  the  game  73  in  Fig.  14.5. 

14.7.  Computation  of  Sequential  Equilibrium  (2) 

Consider  the  following  extensive  form  game  below. 
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0, 1 


3,0 

1,0 


2,2 


Fig.  14.11  The  signaling  game  of  Problem  14.5 


(a)  Determine  the  strategic  form  of  this  game. 

(b)  Determine  all  Nash  and  subgame  perfect  Nash  equilibria  of  this  game. 

(c)  Determine  all  sequential  equilibria  of  this  game. 


R 


14.8.  Computation  of  Sequential  Equilibrium  (3) 

Consider  the  following  extensive  form  game  below. 

(a)  Determine  the  strategic  form  of  this  game. 

(b)  Compute  the  Nash  equilibria  and  subgame  perfect  Nash  equilibria. 

(c)  Compute  the  sequential  equilibria. 
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14.9.  Computation  of  Sequential  Equilibrium  (4) 
Consider  the  following  extensive  form  game. 


R 


Compute  all  sequential  equilibria  of  this  game. 

14.10.  Computation  of  Sequential  Equilibrium  (5) 

Compute  all  Nash,  subgame  perfect  and  sequential  equilibria  in  the  following  game. 


270 


14  Extensive  Form  Games 


14.5  Notes 

For  more  about  refinements  in  extensive  form  games  and  some  relations  with 
refinements  of  Nash  equilibrium  in  strategic  form  games  see  van  Damme  (1991) 
and  Perea  (2001).  The  presentation  in  Sect.  14.1  is  based  on  the  latter  source.  For 
most  of  the  remarks  below  Perea  (2001)  is  a  good  reference. 

The  condition  of  perfect  recall  was  introduced  by  Kuhn  (1953).  For  a  proof  of 
Theorem  14.6  see  that  paper,  or  Perea  (2001),  Theorem  2.4.4. 

In  Example  14.8,  the  strategies  BC  and  BD  are  equivalent  for  all  practical 
purposes.  Indeed,  some  authors  do  not  distinguish  between  these  strategies.  More 
generally,  actions  of  players  excluded  by  own  earlier  actions  could  be  left  out,  but 
we  do  not  do  this  in  this  book. 

By  carrying  out  a  backward  induction  procedure  as  in  Sect.  4.3,  it  can  be  seen  that 
a  subgame  perfect  equilibrium  exists  in  any  extensive  form  game.  This  is  intuitive 
but  not  trivial.  See,  e.g.,  Perea  (2001). 

In  the  literature  the  condition  of  updating  consistency  has  been  proposed  to 
remedy  the  defect  in  a  game  like  the  one  in  Fig.  14.6:  with  player  1  playing  a 
and  player  2  playing  e,  updating  consistency  would  imply  that  P(x?)  =  /3(vi),  as 
seems  natural.  Consistency  of  beliefs  based  on  limits  of  completely  mixed  strategies 
was  introduced  in  Kreps  and  Wilson  (1982).  This  consistency  condition  is  stronger 
than  Bayesian  consistency  combined  with  ‘updating  consistency’,  since  the  latter 
condition  would  only  require  a  player  to  ‘update’  his  own  earlier  beliefs.  Also 
sequential  equilibria  were  introduced  in  Kreps  and  Wilson  (1982).  The  definition 
of  sequential  rationality  in  Definition  14.11  is  actually  called  ‘local’  sequential 
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rationality.  Together  with  consistency,  it  implies  the  sequential  rationality  condition 
in  Kreps  and  Wilson.  It  is,  however,  easier  to  apply.  See  also  Perea  (2001),  Chap.  4, 
or  van  Damme  (1991),  Chap.  6.  See  the  same  sources  for  a  proof  that  sequential 
equilibria  always  exist. 

There  is  no  unified  definition  of  perfect  Bayesian  equilibrium  in  the  literature, 
since  it  is  hard  to  capture  all  instances  of  Bayesian  updating  for  general  extensive 
for  games.  For  instance,  Fudenberg  and  Tirole  (1991b)  provide  two  definitions  of 
perfect  Bayesian  equilibrium  for  different  classes  of  games  and  show  that  their 
concepts  coincide  with  sequential  equilibrium. 
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In  this  chapter  we  go  deeper  into  evolutionary  game  theory.  The  concepts  of 
evolutionary  stable  strategy  and  replicator  dynamics,  introduced  in  Chap.  8,  are 
further  explored.  It  may  be  helpful  to  study  Chap.  8  first,  although  the  present 
chapter  is  largely  self-contained. 

In  Sect.  15.1  we  briefly  review  symmetric  two-player  games.  Section  15.2 
discusses  evolutionary  stable  strategies  and  Sect.  15.3  replicator  dynamics. 


1 5.1  Symmetric  Two-Player  Games 

Much  of  evolutionary  game  theory  is  concerned  with  symmetric  two-player  games. 
A  (finite)  symmetric  two-player  game  is  a  pair  of  mxm  payoff  matrices  (A,  B )  such 
that  B  =  Ar ,  i.e.,  B  =  (bij)™j=l  is  the  transpose  of  A  =  (tfy)”=1.  In  other  words,  for 
all  ij  G  {1,2,...,  m},  we  have  by  =  a^. 

In  such  a  game  we  are  particularly  interested  in  symmetric  (pure  and  mixed 
strategy)  Nash  equilibria.  A  Nash  equilibrium  (<71,02)  is  symmetric  if  04  =  02- 
We  denote  by  NE(A,Ar )  the  set  of  all  Nash  equilibria  of  (A,  Ar)  and  by 

NE(A )  =  {x  G  A 772  |  (x,x)  G  NE(A,At)} 

the  set  of  all  strategies  that  occur  in  a  symmetric  Nash  equilibrium.  By  a  standard 
application  of  the  Kakutani  Fixed  Point  Theorem  we  prove  that  this  set  is  nonempty. 

Proposition  15.1  For  any  m  x  m-matrixA,  NE(A )  ^  0. 

Proof  For  each  x  G  Am,  viewed  as  a  strategy  of  player  2  in  (A,Ar),  let  /3\(x)  be 
the  set  of  best  replies  of  player  1  in  (A,Ar).  Then  the  correspondence  x  i->  /3i(x) 
is  upper  semi-continuous  and  convex- valued  (check  this),  so  that  by  the  Kakutani 
Fixed  Point  Theorem  22.11  there  is  an  x*  g  A777  with  x*  G  /3i(x*).  Since  player 
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2’s  payoff  matrix  is  the  transpose  of  A,  it  also  follows  that  x*  e  /?2 (x*).  Hence, 
(x*,  X*)  e  NE(A,Ar),  so  x*  e  NE(A).  U 


1 5.1 .1  Symmetric  2  x  2-Games 

For  later  reference  it  is  convenient  to  have  a  classification  of  symmetric  2  x  2-games 
with  respect  to  their  symmetric  Nash  equilibria.  Such  a  game  is  described  by  the 
payoff  matrix 


A  —  (  a  11  ai2 
V  a2\  <222 

For  the  purpose  of  Nash  equilibrium  analysis,  we  may  consider  without  loss  of 
generality  the  matrix 

a\\-a2\  an  -  an\  _  f  an  -  a2\  0  \  _  /<2i  0 

a2\  —  Cl2\  Cl22  —  (2 12  /  V  0  Cl22  ~  Cl\2  )  \  0  a2 

where  a\  a\\  —  a2\  and  a2  :=  a22  —  (in-  Indeed,  it  is  straightforward  to  verify 

that  (p,  q)  G  A2  x  A2  is  a  Nash  equilibrium  of  (A,Ar)  if  and  only  if  it  is  a  Nash 
equilibrium  of  ( A',A'T ). 

For  a  generic  matrix  A,  implying  a\,a2  ^  0,  there  are  essentially  three  different 
cases: 

(1)  a\  <  0,  a2  >  0.  In  this  case,  NE(A')  =  {e2},  i.e.,  each  player  playing  the  second 
strategy  is  the  unique  symmetric  Nash  equilibrium. 

(2)  a\,a2  >  0.  In  this  case,  NE(Af)  —  {e1,  e2,  x},  where  x  =  (^2/(^1  +<22),  cl\/ (^1  + 
a2)). 

(3)  cl  1 ,  U2  <  0.  In  this  case,  NE(Af)  —  {x}  with  x  as  in  (2). 


1 5.2  Evolutionary  Stability 
1 5.2.1  Evolutionary  Stable  Strategies 

In  evolutionary  game  theory  the  interpretation  of  a  symmetric  two-person  game 
is  that  players  in  a  possibly  large  population  randomly  meet  in  pairs.  Let  such  a 
game  be  described  by  A,  then  a  mixed  strategy  x  e  Am  is  interpreted  as  a  vector 
of  population  shares:  for  each  k ,  Xk  is  the  share  of  the  population  that  ‘plays’  pure 
strategy  k.  Such  a  strategy  is  called  evolutionary  stable  if  it  performs  better  against 
a  small  ‘mutation’  than  that  mutation  performs  against  itself.  Formally,  we  have  the 
following  definition. 
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Definition  15.2  A  strategy  x  e  A 7/7  is  an  evolutionary  stable  strategy  (ESS)  in  A  if 
for  every  strategy  y  e  Am,  y  ^  x,  there  exists  some  0  <  £y  <  1  such  that  for  all 
0  <  £  <  £y  we  have 


xA(>y  +  (1  -  e)x)  >  yA(sy  +  (1  -  e)x).  (15.1) 

The  set  of  all  ESS  is  denoted  by  ESS  (A).  □ 

Again,  the  interpretation  of  an  ESS  x  is  as  follows.  Consider  any  small  mutation 
sy  +  (1  —  s)x  of  x.  Condition  (15.1)  then  says  that  against  such  a  small  mutation, 
the  original  strategy  x  is  better  than  the  mutant  strategy  y.  In  other  words,  if  the 
population  x  is  invaded  by  a  small  part  of  the  mutant  population  y,  then  x  survives 
since  it  fares  better  against  this  small  mutation  than  the  mutant  y  itself  does. 

Evolutionary  stable  strategies  can  be  characterized  as  follows.  In  fact,  this 
characterization  was  used  as  the  definition  of  an  evolutionary  stable  strategy  in 
Chap.  8  (Definition  8.4). 

Theorem  15.3  Let  Abe  a  symmetric  m  x  m  game.  Then 

ESS  (A)  =  {x  g  NE(A)  |  Vy  e  A777  ,  y  ^  x  [xAx  =  yAx  =>  xAy  >  yAy]}  . 

This  theorem  follows  from  Propositions  15.4  and  15.5. 

Proposition  15.4  Let  A  be  an  mx  m-matrix  and  let  x  e  ESS(A).  Then  x  e  NE(A). 

Proof  Let  y  e  A777  ,  then  it  is  sufficient  to  show  xAx  >  yAx.  Let  sy  be  as  in 
Definition  15.2,  then 


xA(ey  +  (1  -  e)x)  >  yA(sy  +  (1  -  e)x) 

for  all  0  <  s  <  sy  by  (15.1).  By  letting  £  go  to  zero,  this  implies  xAx  >  yAx.  ■ 

Proposition  15.5  Let  A  be  an  m  x  m-matrix.  Ifxe  ESS(A),  then,  for  all  y  e  A777 
with  y  f^xwe  have: 

xAx  —  y Ax  =>  xAy  >  yAy .  (15.2) 

Conversely,  ifxe  NE(A)  and  (15.2)  holds,  then  x  e  ESS  (A). 

Proof  Let  x  e  ESS(A).  Let  y  e  A777  with  y  ^  x  and  xAx  =  yAx.  Suppose  that 
yAy  >  xAy.  Then,  for  any  s  e  [0, 1],  yA(sy  +  (1  —  £)x)  >  xA(gy  +  (1  —  e)x), 
contradicting  (15.1). 

Conversely,  let  x  e  NE(A )  and  let  (15.2)  hold  for  x.  If  xAx  >  yAx,  then  also 
xA(sy  +  (1  —  £)x)  >  yA(£y  +  (1  —  £)x)  for  small  enough  s.  If  xAx  =  yAx,  then 
xAy  >  yAy,  hence  (15.1)  holds  for  any  £  e  (0, 1].  ■ 
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Clearly,  Theorem  15.3  follows  from  the  preceding  propositions. 

From  Problems  15.1-15.3  it  follows  that  the  concept  of  ESS  does  not  ‘solve’ 
the  prisoners’  dilemma  nor  the  ‘coordination  problem’.  Also,  an  ESS  may  be 
completely  mixed  (Hawk-Dove),  or  fail  to  exist  (Rock- Paper-Scissors). 


1 5.2.2  The  Structure  of  the  Set  ESS(A ) 

Let  x  be  an  ESS  and  let  y  e  Am,  y  ^  x  such  that  the  carrier  of  y  is  contained  in 
the  carrier  of  x,  i.e.,  C(y)  c  C(x).  Since  x  e  NE(A )  by  Theorem  15.3,  this  implies 
xAx  =  yAx  and  hence,  again  by  Theorem  15.3,  xAy  >  yAy.  We  have  established: 

Proposition  15.6  Ifx  e  ESS  (A)  and  y  e  Am  with  y  ^  x  and  C(y)  c  C(x),  then 
y  t  NE(A). 

This  implies  the  following  corollary  (check!): 

Corollary  15.7  The  set  ESS(A)  is  finite.  Ifx  e  ESS(A)  is  completely  mixed,  then 
ESS  (A)  =  {x}. 


1 5.2.3  Relations  with  Other  Refinements 

If  x  e  NE(A)  is  weakly  dominated  by  y  e  Am,  then  xAx  =  yAx  and  yAy  > 
xAy;  so  by  Theorem  15.3,  x  ESS(A).  Therefore,  if  x  e  ESS(A),  then  (x,  x)  is 
an  undominated  equilibrium  and  hence  perfect  by  Theorem  13.23.  It  can  even  be 
shown  that  (x,  x)  is  proper.  The  next  proposition  summarizes  these  facts. 

Proposition  15.8  Ifx  e  ESS(A),  then  (x,  x)  e  NE(A,Ar)  is  undominated,  perfect 
and  proper. 

The  unique  (symmetric)  equilibrium  in  the  Rock- Paper- Scissors  game  in  Prob¬ 
lem  15.3  is  proper  (why?),  but  the  associated  equilibrium  strategy  is  not  ESS,  so 
the  converse  of  Proposition  15.8  does  not  hold. 


1 5.2.4  Other  Characterizations  of  ESS 
15.2.4.1  Uniform  Invasion  Barriers 

The  number  sy  in  the  definition  of  an  ESS  can  be  interpreted  as  an  ‘invasion  barrier’ : 
if  the  share  of  the  mutant  strategy  y  is  smaller  than  ey,  then  the  ‘incumbent’  strategy 
x  fares  better  against  the  mutated  population  than  the  mutant  y  itself  does,  so  that 


Recall — see  Chap.  13 — that  the  carrier  of  y,  C(y),  is  the  set  {/  £  {1 , ...  ,m)  \  yi  >  o}. 
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the  mutant  strategy  becomes  extinct.  In  a  large  but  finite  population,  it  would  not 
make  sense  if  this  invasion  barrier  could  become  arbitrarily  small  since  then  the 
‘mutant’  population  would  sometimes  have  to  consist  of  less  than  one  individual 
to  guarantee  survival  of  the  strategy  x  under  consideration.  This  gives  rise  to  the 
following  definition. 

Definition  15.9  A  strategy  x  £  Am  has  a  uniform  invasion  barrier  if  there  exists  an 
s  £  (0, 1)  such  that  (15.1)  holds  for  all  strategies  y  ^  x  and  every  s  £  (0,  s).  □ 

It  turns  out  that  possessing  a  uniform  invasion  barrier  characterizes  an  evolutionary 
stable  strategy. 

Proposition  15.10  For  each  x  £  Am,  x  £  ESS(A )  if  and  only  ifx  has  a  uniform 
invasion  harrier. 

Proof  Let  x  £  Am.  If  x  has  a  uniform  invasion  barrier  £,  then  clearly  x  is  an  ESS  by 
choosing,  in  (15.1),  sy  =  £  for  each  y  £  Am. 

Conversely,  let  x  be  an  ESS.  Define  the  function  h  :  Am  \  {x}  — >  [0, 1]  by 

b(y)  =  sup{5  £  [0, 1]  |  V£  £  (0, 8)  [(x  -  y)A(sy  +  (1  -  £)x)  >  0]} 

for  all  y  £  Am  \  {x}.  We  first  consider  the  function  h  on  the  compact  set  Z  =  {z  £ 
Am  |  Zi  =  0  for  some  i  £  C(x)}.  Consider  y  £  Z.  Since  x  is  an  ESS,  we  have 
that  (x  —  y)A(£y  +  (1  —  £)x)  is  positive  for  small  positive  values  of  £.  Since  this 
expression  depends  linearly  on  £,  this  implies  that  there  can  be  at  most  one  value  of 
£,  which  we  denote  by  ey ,  such  that  (x  —  y)A(£^y  +  (1  —  £px)  =  0.  If  £y  £  (0,  1), 
then  (x  —  y)A(£^y  +  (1  —  £px)  =  0  implies  that  (x  —  y)A(x  —  y)  ^  0.  To  see  this, 
suppose  that  (x  —  y)A(x  —  y)  =  0.  Then 

0  =  (x  —  y)A(^y  +  (1  -  sy)x)  =  (x  -  y)Ax  , 

hence  also  (x  —  y)Ay  =  0  and,  thus,  (x  —  y)A(£y  +  (1  —  £)x)  =  0  for  all  £, 
a  contradiction.  Hence,  in  that  case,  b( y)  =  £y  —  (x  —  y)Ax/(x  —  y)A(x  —  y); 
otherwise,  b( y)  =  1.  Clearly,  b  is  a  continuous  function.  Since  b  is  positive  and  Z 
is  compact,  minyez  b(y)  >  0.  Hence,  x  has  a  uniform  invasion  barrier,  namely  this 
minimum  value,  on  the  set  Z. 

Now  suppose  that  y  £  Am,  y  ^  x.  We  claim  that  there  is  a  A  £  (0, 1]  such 
that  y  =  Az  +  (1  —  A)xfor  some  z  £  Z.  To  see  this,  first  note  that  we  can  take 
A  =  1  and  z  =  y  if  y  £  Z.  If  y  £  Z  then  consider,  for  each  /z  >  0,  the  point 
z(/z)  =  (1  —  /z)x  +  /zy,  and  let  jl  >  1  be  the  largest  value  of  / 1  such  that  z(/z)  £  Am. 
Then  there  is  a  coordinate  i  £  {l, ...  ,m}  with  Zi(fi)  =  0,  Zi(/a)  >  0  for  all  /a  <  fit, 
and  Zi(fi)  <  0  for  all  /z  >  jl.  Clearly,  this  implies  Xi  >  yt,  hence  i  £  C(x),  and  thus 
z(/z)  £  Z.  Then,  for  z  =  z(/z)  and  A  =  1/  jl,  we  have  y  =  Az  +  (1  —  A)x. 
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By  straightforward  computation  we  have 

(x  —  y)A(sy  +  (1  —  s)x)  =  A(x  —  z)A(sAz  +  (1  —  eX)x) 

for  each  s  >  0,  so  that  b( y)  =  min{Z?(z)/A,  1}  >  b( z). 

We  conclude  that  s  =  minyGz  b(y)  is  a  uniform  invasion  barrier  for  x.  ■ 

15.2.4.2  Local  Superiority 

By  Theorem  15.3,  a  completely  mixed  ESS  earns  a  higher  payoff  against  any  mutant 
than  such  a  mutant  earns  against  itself.  This  global  superiority  property  can  be 
generalized  to  the  following  local  version. 

Definition  15.11  The  strategy  x  g  Am  is  locally  superior  if  it  has  an  open 
neighborhood  U  such  that  xAy  >  yAy  for  all  y  G  U\  {x}.  □ 

The  local  superiority  condition  provides  another  characterization  of  ESS. 

Proposition  15.12  For  each  x  G  A777,  x  G  ESS(A )  if  and  only  if  x  is  locally  superior. 

Proof  Let  x  g  Am. 

First  suppose  that  x  is  locally  superior,  and  let  U  be  as  in  Definition  15. IE  Let 
z  G  Am  \  {x}  and  define  for  each  0  <  s  <  1  the  point  w(V)  by  w(V)  =  sz  +  (1  —  e)x. 
Then  there  is  ez  >  0  such  that  w(e)  e  U,  hence  xAw(e)  >  w(e)Aw(e),  for  all 
s  e  (0,£z).  By  slight  rewriting,  this  implies  xAw(e)  >  zAw(e)  for  all  s  e  (0,  £z).  In 
particular,  we  have 


£XAz  +  (1  —  s)xAx  >  szAz  +  (1  —  £)zAx 

for  all  £  e  (0,  £z),  hence  xAx  >  zAx.  So  x  e  NE(A).  Suppose  now  that  zAx  =  xAx. 
Then,  for  £  G  (0,  £z), 


£xAz  =  xAw(£)  —  (1  —  £)xAx 
>  zAw(£)  —  (1  —  £)xAx 
=  szAz  +  (1  —  £)zAx  —  (1  —  s)xAx 
=  £zAz  , 


so  that  x  is  an  ESS. 

Conversely,  let  x  be  an  ESS  with  uniform  invasion  barrier  (cf.  Proposition  15.10) 
£  g  (0, 1),  and  let  Z  be  as  in  the  proof  of  Proposition  15.10.  Let 


V  =  {y  g  A777  |  y  =  £z  +  (1  —  £)x  for  some  zgZ  and  £  g  [0,  £)}. 
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Since  Z  is  closed  and  x  ^  Z,  there  is  an  open  neighborhood  U  of  x  such  that  U  D 
Am  c  V.  Suppose  that  y  ^  x,  y  £  U  D  Am.  Then  y  £  V,  and  by  Proposition  15.10, 
zAy  =  zA(sz  +  (1  —  £)x)  <  xA(£z  +  (1  —  £)x)  =  xAy,  with  z  as  in  the  definition  of 
V.  This  implies  yAy  =  £zAy  +  (1  —  £)xAy  <  xAy.  ■ 

15.2.4.3  Local  Strict  Efficiency 

Consider  the  special  case  of  a  symmetric  game  (A,  B )  with  Ar  —  A,  hence  A  is  itself 
symmetric  and  B  =  A.  Call  such  a  game  doubly  symmetric. 

Definition  15.13  A  strategy  x  £  Am  is  locally  strictly  efficient  if  it  has  an  open 
neighborhood  U  such  that  xAx  >  yAy  for  all  y  £  U  \  {x}.  □ 

For  doubly  symmetric  games,  local  strict  efficiency  characterizes  ESS. 

Proposition  15.14  Let  A  =  AT.  Then  x  £  ESS(A )  if  and  only  ifx  is  locally  strictly 
efficient. 

Proof  Let  x  £  Am.  For  any  y  ^  x  and  z  —  |x  +  |y,  we  have 

yAy  =  xAx  —  2xAz  —  2zAx  +  4zAz  . 

Hence,  using  the  symmetry  of  A, 

xAx  —  yAy  =  4  [xAz  —  zAz]  . 

If  x  is  locally  strictly  efficient,  then  this  identity  implies  that  x  is  locally  superior, 
and  conversely.  By  Proposition  15.12,  it  follows  that  x  is  an  ESS  if  and  only  if  x  is 
locally  strictly  efficient.  ■ 


1 5.3  Replicator  Dynamics  and  ESS 

The  concept  of  an  evolutionary  stable  strategy  is  based  on  the  idea  of  mutation. 
Incorporation  of  the  evolutionary  concept  of  selection  calls  for  a  more  explicitly 
dynamic  approach. 


15.3.1  Replicator  Dynamics 

As  before,  consider  a  symmetric  game  described  by  the  m  x  m  matrix  A.  A  mixed 
strategy  x  £  Am  can  be  interpreted  as  a  vector  of  population  shares  (a  state)  over  the 
pure  strategies,  evolving  over  time.  To  express  time  dependence,  we  write  x  =  x(t). 
For  each  pure  strategy  i,  the  expected  payoff  of  playing  i  when  the  population 
is  in  state  x  is  equal  to  elAx ,  hence  the  average  population  payoff  is  equal  to 
!  Xi&Ax  =  xAx.  In  the  replicator  dynamics  it  is  assumed  that  population  shares 
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develop  according  to  the  differential  equation 

xi  —  dxi(t)/dt  —  [VAx  —  xAx]  X[ 


(15.3) 


for  each  pure  strategy  i  —  1,2 , ...  ,m,  where  dependence  on  t  is  (partly)  suppressed 
from  the  notation.  In  other  words,  the  share  of  the  population  playing  strategy  i 
changes  with  rate  proportional  to  the  difference  between  the  expected  payoff  of  i 
(individual  fitness)  and  the  average  population  payoff  (average  fitness). 

To  study  the  replicator  dynamics  in  (15.3)  one  needs  to  apply  the  theory  of 
differential  equations  and  dynamical  systems.  For  a  first  analysis  we  can  restrict 
attention  to  a  few  basic  concepts  and  facts. 

For  each  initial  state  x(0)  =  x°  G  Am,  the  system  (15.3)  induces  a  solution  or 
trajectory  £(t,  x°)  in  Am.  Call  state  x  a  stationary  point  of  the  dynamics  (15.3)  if 
x  =  (x\ , . . . ,  xm)  —  (0, . . . ,  0).  If  m  —  2  then  x\  —  0  or  X2  =  0  is  sufficient  for 
x  to  be  a  stationary  point,  since  (15.3)  implies  the  natural  condition  Yl?=i  'Xi  ~  0- 
Note  that  any  e*  is  a  stationary  point — this  is  a  more  or  less  artificial  property  of  the 
replicator  dynamics.  A  state  x  is  Lyapunov  stable  if  every  open  neighborhood  B  of 
x  contains  an  open  neighborhood  B°  of  x  such  that  £  ( t ,  x°)  e  B  for  all  x°  e  B°  and 
t  >  0.  A  state  x  asymptotically  stable  if  it  is  Lyapunov  stable  and  it  has  an  open 
neighborhood  B*  such  that  lim^oo  £(t,  x°)  =  x  for  all  x°  e  B* .  It  is  easy  to  see  that 
Lyapunov  stability  implies  stationarity. 

Before  studying  the  replicator  dynamics  in  more  detail,  we  state  the  following 
useful  fact  without  proof — see  the  Notes  to  this  chapter  for  a  reference.  By  A™  we 
denote  the  (relative)  interior  of  the  set  Am,  i.e.,  Aq  =  {x  G  Am  |  x  >  0}  is  the  set 
of  completely  mixed  strategies  or  states. 

Proposition  15.15  The  replicator  dynamics  (15.3)  has  a  unique  solution  £(f,  x°), 
t  G  M,  through  any  initial  state  x°  G  Am.  The  solution  mapping  £  :  R  x  Am  — >  Am 
is  continuous,  and  continuously  differentiable  with  respect  to  time.  Both  Aq  and 
Am  \  Aq  are  invariant,  that  is,  £(t,  x°)  e  A™  for  all  t  whenever  x°  G  Aq  and 
£(t,  x°)  G  Am  \  A™  for  all  t  whenever  x°  G  Am  \  A™. 

1 5.3.2  Symmetric  2x2  Games 

In  order  to  analyze  the  replicator  dynamics  for  symmetric  2x2  games  corresponding 
to  A,  we  can  without  loss  of  generality  restrict  attention  again  to  the  normalized 
game 


Now  (15.3)  reduces  to 


k\  —  [a\X\  —  ayxfyx\X2 


(15.4) 
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(and  %2  —  —  x\).  For  case  (1)  in  Sect.  15.1.1,  a \  <  0  and  <22  >  0,  the  stationary 
points  of  the  dynamics  are  x  =  e1  and  x  =  e2.  For  all  other  x,  k\  <  0,  which  implies 
that  the  system  then  converges  to  e2,  the  unique  ESS.  Hence,  the  (unique)  ESS  is 
also  the  (unique)  asymptotically  stable  state. 

From  the  answers  to  Problems  15.5  and  15.6  we  have: 

Proposition  15.16  Let  A  be  a  generic  2x2  matrix  and  let  x  e  A2.  Then  x  e  ESS(A) 
if  and  only  ifx  is  an  asymptotically  stable  state  of  the  replicator  dynamics. 

Note  that  this  proposition  implies  Proposition  8.5(b).  Part  (a)  of  Proposition  8.5 
follows  from  Problem  15.2. 


15.3.3  Dominated  Strategies 

Does  the  replicator  dynamics  discard  of  dominated  strategies?  One  answer  to  this 
question  is  provided  by  the  following  proposition,  which  states  that  if  we  start  from 
a  completely  mixed  strategy  eventually  all  strictly  dominated  pure  strategies  vanish, 
i.e.,  their  population  shares  converge  to  zero. 

Proposition  15.17  Let  x°  e  Am  be  completely  mixed  and  let  pure  strategy  i  be 
strictly  dominated.  Then  lim^oo  ^(t,  x°)  =  0  . 


Proof  Let  i  be  strictly  dominated  by  y  e  Am  and  let 

s  —  min  yAx  —  e'Ax  . 

XE  Am 


By  continuity  of  the  expected  payoff  function  and  compactness  of  Am,  s  >  0.  Define 
Vi  :  A™  -*  R  by  u,(x)  =  lnx,  -  yj  In (xj).  The  function  V;  is  differentiable, 

with  time  derivative  at  any  point  x  =  £  (t,  x°)  equal  to 


~dvi(l;  (r,x0))" 


dt 


f(r,x°)=x 


dvi(x) . 

~WXJ 


X_i_ 

Xi 


m 


Xi 


=  (e?  —  x)Ax  —  —  x)Ax 

j=  1 


—  (e?  —  y)Ax  <  —  s  <  0  . 
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(Cf.  Proposition  15.15.)  Hence,  u/(£(t,  x0))  decreases  to  minus  infinity  as  t  ->  oo. 
This  implies  ln(£(t,  x0))  decreases  to  minus  infinity,  so  that  ^  (t,  x°)  — >►  0.  ■ 

Proposition  15.17  remains  true  for  pure  strategies  i  that  are  iteratively  strictly 
dominated.  For  weakly  dominated  strategies  several  things  may  happen,  see  Prob¬ 
lem  15.7. 


15.3.4  Nash  Equilibrium  Strategies 


Consider  again  the  finite  symmetric  two-player  game  with  payoff  matrix  A.  What 
is  the  relation  between  the  replicator  dynamics  and  Nash  equilibrium  strategies? 
The  answer  is  given  by  the  following  proposition,  where  ST  (A)  denotes  the  set  of 
stationary  states,  hence  (check!): 


ST(A )  =  {x  g  Am 


V/  G  C(x)  [e'Ax  =  xAx]}  . 


(15.5) 


Proposition  15.18  For  any  finite  symmetric  two-player  game  with  payoff  matrix  A 
we  have: 


(a)  {e1, . . . ,  em}  U  NE(A )  c  ST  (A), 

(b)  ST  (A)  n  Aq  =  NE(A)  n  Ag1, 

(c)  ST  (A)  fl  Aq  is  a  convex  set  and  ifze  Am  is  a  linear  combination  of  states  in 
this  set,  then  z  G  NE(A). 

Proof  It  is  straightforward  from  (15.3)  that  e7  G  ST(A)  for  every  pure  strategy  i.  If 
x  G  AT (A ),  then  every  i  G  C(x)  is  a  pure  best  reply,  hence  elAx  —  xAx.  Hence, 
x  G  ST  (A).  This  proves  (a).  As  to  (b),  part  (a)  implies  that  ST(A)  fl  A™  D  AT (A )  fl 
Aq  .  Further,  eAx  =  xAx  for  every  x  G  5T(A)  fl  Aq  and  every  i  G  Am,  so  that 
x  G  AT (A )  D  Aq  .  This  proves  (b). 

It  remains  to  prove  the  last  claim.  Let  x  and  y  be  completely  mixed  stationary 
points,  and  let  a,  /3  g  R  and  z  =  ax  +  fy  g  Am.  For  any  pure  strategy  i  we  have 

e'Az  =  aelAx  +  felAy  =  axAx  +  fyAy 

since  x,  y  g  5T(A)  fl  Aq  .  This  implies  that  actually  elAz  —  zAz  for  all  pure  strategies 
i,  hence  z  is  stationary.  If  z  is  completely  mixed,  then  we  are  done  by  part  (b). 
Otherwise,  z  is  a  boundary  point  of  ST(A)  fl  Aq  and  hence  of  NE(A )  Pi  A70n ,  so  z  G 
NE(A )  since  NE(A )  is  a  closed  set.  Finally,  since  Am  is  convex  and  z  G  ST  (A)  fl  Aq 
for  all  a,  ft  >  0  with  a  +  ft  =  1,  ST (A)  fl  Aq  is  a  convex  set.  ■ 

Proposition  15.18  implies  that  every  (symmetric)  Nash  equilibrium  is  stationary. 
The  weakest  form  of  dynamical  stability,  Lyapunov  stability,  leads  to  a  refinement 
of  Nash  equilibrium,  as  the  next  result  shows. 
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Proposition  15.19  Let  x  G  Am  be  a  Lyapunov  stable  stationary  state.  Then  x  G 
NE(A). 

Proof  Suppose  x  ^  NE{A).  Then  elAx  —  xAx  >  0  for  some  i  G  N.  By  continuity, 
there  is  a  8  >  0  and  an  open  neighborhood  U  of  x  such  that  elAy  —  yAy  >  8 
for  all  y  G  U  C l  Am.  Let  x°  g  U  D  Am  with  >  0.  Then  £/(/,  x°)  >  exp(7h) 
for  all  t  >  0;  this  follows  from  the  fact  that  the  system  i)  =  8r)  has  solution 
r\{t)  =  r/(0)  exp^).  So  ^(t,  x°)  increases  exponentially  from  any  x°  G  U D  Aq  with 
x\ 9  >  0.  This  contradicts  Lyapunov  stability  of  x.  ■ 

The  final  result  in  this  subsection  says  that  if  a  trajectory  of  the  replicator  dynamics 
starts  from  an  interior  (completely  mixed)  state  and  converges,  then  the  limit  state 
is  a  Nash  equilibrium  strategy. 

Proposition  15.20  Let  x°  g  Aq  and  x  G  Am  such  that  x  =  lim^oo  £(t,  x°).  Then 
x  G  NE(A). 

Proof  Suppose  that  x  ^  NE(A).  Then  there  is  a  pure  strategy  i  and  an  s  >  0  such  that 
e?Ax  —  xAx  =  s.  Hence,  there  is  a  T  g  R  such  that  e?A£(/\  x°)  —  x°)A£(^,  x°)  > 

£/2  for  all  t  >  T.  By  (15.3),  i:/  >  v/  e/2  for  all  t  >  T,  and  hence  (as  in  the  proof 
of  Proposition  15.19)  £*(t,  x°)  >  ^-(T,  x°)  exp(e(t  —  T)/2)  for  all  t  >  T.  Since 
^•(T,  x°)  >  0,  this  implies  ^-(t,  x°)  ->  oo  as  t  ->  oo,  a  contradiction.  ■ 


15.3.5  Perfect  Equilibrium  Strategies 

In  the  preceding  subsection  we  have  seen  that  Lyapunov  stability  implies  Nash 
equilibrium.  What  are  the  implications  of  asymptotic  stability? 

First,  asymptotic  stability  implies  Lyapunov  stability  and  therefore  also  Nash 
equilibrium.  Since  Nash  equilibrium  implies  stationarity,  however,  it  must  be  the 
case  that  an  asymptotically  stable  Nash  equilibrium  strategy  is  isolated ,  meaning 
that  it  has  an  open  neighborhood  in  which  there  are  no  other  Nash  equilibrium 
strategies.  If  not,  there  would  be  arbitrarily  close  stationary  states,  which  conflicts 
with  asymptotic  stability. 

Second,  asymptotic  stability  also  implies  perfection. 

Proposition  15.21  Let  x  g  Am  be  asymptotically  stable.  Then  x  G  NE(A)  and  x  is 
isolated.  Moreover,  (x,  x)  is  a  perfect  equilibrium  in  (A,  AT). 

Proof  We  still  have  to  prove  that  (x,  x)  is  a  perfect  equilibrium  in  (A,  Ar).  Suppose 
not.  Then  x  is  weakly  dominated  by  some  y  g  Am  \  {x},  see  Theorem  13.23.  Hence 
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yAz  >  xAz  for  all  z  G  Am.  Define  v  :  Am  ->  R  by 

v(z)  =  y]  Cy«  -  x,)  lnfc) 

ieC(  z) 

for  all  z  G  Am.  Similarly  as  in  the  proof  of  Proposition  15.17,  we  obtain  that  v  is 
nondecreasing  along  all  interior  solution  trajectories  of  (15.3),  i.e.,  at  any  z  G  Aq 
(so  z  completely  mixed), 

.  m 

i’(z)  =  E  (v,  -  x,)-  =  y](v,  -  x,)[e',4z  -  zAz]  =  (y  -  x)Az  >  0  . 

ieC(z)  Zl  i=  1 

Since  x  is  asymptotically  stable,  it  has  an  open  neighborhood  U  such  that  ^(t,  x°)  -> 
x  for  all  x°  g  U  D  Am.  By  nondecreasingness  of  v  along  all  interior  solution 
trajectories  this  implies  v(x)  >  v(z )  for  all  z  G  U  D  Aq  .  We  will  construct,  however, 
a  z  in  f/n  Aq  with  v  (z)  >  v  (x) .  This  is  a  contradiction  and,  hence,  x  must  be  perfect. 
To  construct  such  a  z,  define  for  8  g  (0, 1),  w  G  Aq  ,  and  £  e  [0, 1], 

z  =  (1  —  £)[(1  -  £)x  +  <ty]  +  £W  . 

Fora:  sufficiently  small,  we  have  y*  >  X[  =y  Zi  >  and  y*  <  =>>  Zi  <  v*.  Moreover, 

z  G  Aq  and 


v(z)  -v(x)  =  7>,  -  x,)  I n ( z/ )  -  E  (y,  - x,)  ln(x,) 

i=  1  ieC(x) 

=  E  ()’,  -  x,)[ln(-,)  -  ln(x,)]  +  y]  y/lnfc)  . 

/€C(x)  i^C(x) 

Note  that  the  first  term  after  the  second  inequality  sign  is  positive.  We  will  show 
that  the  second  term  is  zero,  which  completes  the  proof  of  the  proposition,  since 
then  v(z )  >  v(x).  To  show  that  J2i£C(x)  E  lnfo)  —  0,  it  is  sufficient  to  show  that 
C(  y)  c  C(x).  Suppose  that  j  g  C(y)  and  j  ^  C(x).  By  asymptotic  stability  of  x, 
£  ( t ,  x°)  ->  x  for  all  x°  G  U  D  Aq  .  Write 

v(l(t  ,X°))=  y]  (y; -x,)ln(^(f,x0)) 

z:£/(/\x°)>0,/eC(x) 

+  E  ln®v  x°)) . 

/:£/(?, x°)>0,/^C(x) 
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The  first  term  after  the  inequality  sign  is  bounded  by  some  constant  y,  since 
£;(*,  x°)  ->  Xj  >  0.  The  second  term  converges  to  —  oo  since  %j(t,  x°)  — ►  xj  —  0  and 
yj  >  0.  But  v(£(t,x0))  ->  — oo  contradicts  the  nondecreasingness  of  v  along  the 
trajectory  £  ( t ,  x°) .  ■ 


15.4  Problems 


15.1.  Computing  ESS  ini  x  2  Games  (1) 

Compute  ESS  (A)  for  the  following  payoff  matrices  A. 


(a)  A  = 


(b)  A  = 


(c)  A  = 


(Prisoners’  Dilemma) 


(Coordination  game) 


J  (Hawk-Dove  game) 


15.2.  Computing  ESS  inly.  2  Games  (2) 

Compute  ESS(A')  for  each  of  the  cases  (1),  (2),  and  (3)  in  Sect.  15.1.1.  Compare 
with  your  answers  to  Problem  15.1. 


15.3.  Rock-Paper-Scissors  (1) 

Show  that  the  Rock- Paper-Scissors  game 


has  no  ESS. 

15.4.  Uniform  Invasion  Barriers 

Find  the  maximal  value  of  the  uniform  invasion  barrier  for  the  ESS’s  in  each  of  the 
cases  (1),  (2),  and  (3)  in  Sect.  15.1.1. 

15.5.  Replicator  Dynamics  in  Normalized  Game  (1) 

Show  that  A  and  A!  (see  Sect.  15.3.2)  result  in  the  same  replicator  dynamics. 

15.6.  Replicator  Dynamics  in  Normalized  Game  (2) 

(a)  Simplify  the  dynamics  (15.4)  for  case  (1)  in  Sect.  15.1.1  by  substituting  X2  = 
1  —  x\  and  plot  x\  as  a  function  of  x\  e  [0,  1]. 

(b)  Carry  out  this  analysis  also  for  cases  (2)  and  (3).  What  is  your  conclusion? 


286 


15  Evolutionary  Games 


15.7.  Weakly  Dominated  Strategies  and  Replicator  Dynamics 

(a)  Consider  the  matrix 


Investigate  the  trajectories  of  the  replicator  dynamics, 

(b)  Consider  the  matrix 


(‘ '  °) 
\0  0  0/ 


Investigate  the  trajectories  of  the  replicator  dynamics. 
[Cf.  Proposition  15.17.] 

15.8.  Stationary  Points  and  Nash  Equilibria  (1) 

Consider  the  two-person  symmetric  game  with  payoff  matrix 

/0  2  0\ 

A  =  2  0  0  . 

\1  1  0/ 


(a)  Compute  NE (A). 

(b)  Compute  ST (A). 

15.9.  Stationary  Points  and  Nash  Equilibria  (2) 

Consider  the  two-person  symmetric  game  with  payoff  matrix 

/3  3  1 
A  =  4  4  0 

\0  2  4 

(a)  Compute  NE  (A). 

(b)  Compute  ST  (A). 

(c)  Which  stationary  states  are  asymptotically  stable,  which  are  Lyapunov  stable, 
and  which  are  not  Lyapunov  stable? 

(d)  Show  directly  from  the  definition  of  local  superiority  that  the  strategy 
v  =  (1/2, 0,1/2)  is  not  locally  superior.  [Hint:  consider  the  strategy 
z=  (1/2,  A,  1/2-  A)  for  0  <  A  <  1/2.] 
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15.10.  Lyapunov  Stable  States  in  2x2  Games 

Consider  the  normalized  two-player  symmetric  2x2  game  A'.  Compute  the 
Lyapunov  stable  states  for  cases  (1),  (2),  and  (3). 

15.11.  Nash  Equilibrium  and  Lyapunov  Stability 
Consider  the  symmetric  game  with  payoff  matrix 

/0  1  0 
A  =  0  0  2 

\0  0  1 

Compute  NE(A).  Show  that  the  unique  element  in  this  set  is  not  Lyapunov  stable. 

15.12.  Rock-Paper-Scissors  (2) 

Consider  the  generalized  Rock- Paper-Scissors  game  with  payoff  matrix 

/  1  2 +a  0  \ 

A  =  I  0  1  2  +  a 

\2  -ha  0  1  / 


where  a  e  R. 


(a)  Write  down  the  three  equations  of  the  replicator  dynamics. 

(b)  Define  h(x)  =  \n(x\X2X3)  for  x  positive  and  show  that  h(x)  —  3  +  a  —  3 xAx. 

(c)  Show  that  the  average  payoff  is  equal  to 


a 

xAx  =  1  +  -(1 
2 v 


2 


) 


for  each  x  e  A3,  where  |  |x|  |  is  the  Euclidean  norm  of  x.  Conclude  that  h(x)  = 
f  (3||x||2  -  1). 

(d)  Show  that  h{ ±)  =  0  and  h(x)  has  the  same  sign  as  a  for  other  x  e  Am. 

(e)  Show  that  the  unique  Nash  equilibrium  in  this  game  is  (i)  asymptotically  stable 
for  a  >  0;  (ii)  Lyapunov  but  not  asymptotically  stable  for  a  =  0;  (iii)  not 
Lyapunov  stable  for  a  <  0. 


1 5.5  Notes 

This  chapter  is  based  mainly  on  Weibull  (1995).  The  concept  of  an  evolutionary 
stable  strategy  was  introduced  by  Maynard  Smith  and  Price  (1973).  See  Selten 
(1980,  1983)  for  early  applications  of  evolutionary  stable  strategies  in  game  theory. 

For  properness  of  (x,  x),  where  x  is  an  ESS  (Proposition  15.8),  see  van  Damme 
(1991). 
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Uniform  invasion  barriers  were  introduced  by  Vickers  and  Cannings  (1987),  and 
local  superiority  by  Hofbauer  et  al.  (1979).  Proposition  15.14  is  due  to  Hofbauer 
and  Sigmund  (1988). 

Replicator  dynamics  were  introduced  by  Taylor  and  Jonker  (1978).  Proposi¬ 
tion  15.17  remains  true  for  pure  strategies  i  that  are  iteratively  strictly  dominated: 
see  Samuelson  and  Zhang  (1992).  For  Proposition  15.21,  in  particular  the  result 
that  asymptotic  stability  implies  perfection,  see  Bomze  (1986).  For  more  on  weakly 
dominated  strategies  see  Weibull  (1995),  p.  83  ff.  See  Sect.  3.3.2  in  the  same  book 
for  more  on  the  relation  between  Lyapunov  stability  and  Nash  equilibrium.  For  a 
proof  of  Proposition  15.15,  see  Weibull  (1995),  Proposition  3.20. 

A  general  reference  to  the  theory  of  differential  equations  and  dynamical  systems 
is  Hirsch  and  Smale  (1974). 
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Cooperative  Games 


16 


TU-Games:  Domination,  Stable  Sets, 
and  the  Core 


In  a  game  with  transferable  utility  (TU-game)  each  coalition  (subset  of  players)  is 
characterized  by  its  worth,  i.e.,  a  real  number  representing  the  payoff  or  utility  that 
the  coalition  can  achieve  if  it  forms.  It  is  assumed  that  this  payoff  can  be  freely 
distributed  among  the  members  of  the  coalition  in  any  way  desired. 

For  some  examples  the  reader  is  referred  to  Chap.  1.  Chapter  9  presents  a  first 
acquaintance  with  transferable  utility  games.  Although  the  present  chapter  and  the 
following  ones  are  self-contained,  it  may  be  helpful  to  study  the  relevant  parts  of 
Chaps.  1  and  9  first. 

In  this  chapter  the  focus  is  on  the  core  of  a  transferable  utility  game.  Section  16.1 
starts  with  a  weaker  concept,  the  imputation  set,  and  introduces  the  concept  of 
domination.  Section  16.2  introduces  the  domination  core  and  the  core.  Section  16.3 
studies  these  solution  concepts  for  a  special  class  of  TU-games  called  simple  games. 
In  Sect.  16.4  we  briefly  review  von  Neumann  and  Morgenstern’s  stable  sets,  which 
are  also  based  on  the  concept  of  domination.  Section  16.5,  finally,  presents  a 
characterization  of  games  with  non-empty  cores  in  terms  of  balancedness. 


1 6.1  Imputations  and  Domination 

We  start  with  repeating  the  definition  of  a  game  with  transferable  utility  (cf. 
Definition  9.1). 

Definition  16.1  A  cooperative  game  with  transferable  utility  or  TU-game  is  a  pair 
( N ,  v),  where  N  —  {1 , ,n}  with  n  e  N  is  the  set  of  players ,  and  v  is  a  function 
assigning  to  each  coalition  S ,  i.e.,  to  each  subset  KiVa  real  number  v  (S),  such 
that  u(0)  =  0.  The  function  v  is  called  the  characteristic  function  and  v(S)  is  called 
the  worth  of  S.  The  coalition  N  is  called  the  grand  coalition.  A  payoff  distribution 
for  coalition  S  is  a  vector  of  real  numbers  (v;)/es-  □ 
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The  set  of  coalitions  is  also  denoted  by  2N,  so  that  a  TU-game  is  a  pair  (N,  v )  with 
v  :  2N  ->  R  such  that  v  (0)  =  0.  The  game  (N,  v )  is  often  denoted  by  v  if  no 
confusion  about  the  set  of  players  is  likely  to  arise.  Also,  for  a  coalition  {i,j, 
we  sometimes  write  ij, . . . ,  k  or  ij . . .  k  instead  of  {i,j, ...  ,k}.  By  \S\  we  denote 
the  cardinality  of  a  coalition  S.  By  QN  the  set  of  all  TU-games  with  player  set  N  is 
denoted. 

We  frequently  use  the  notation  v(S')  :=  J2iesxi  f°r  a  Pay°ff  distribution  x  = 
(x\, ...  ,xn)  e  RN  and  a  coalition  S  c  N. 

Let  (N,  v )  be  a  TU-game.  A  vector  x  e  RN  is  called  an  imputation  if 

(a)  x  is  individually  rational  i.e. 

Xi  >  v(i)  for  all  i  e  N  , 


(b)  x  is  efficient  i.e. 


jk(A)  =  t;(A)  . 

The  set  of  imputations  of  (N,  v )  is  denoted  by  I(v).  An  element  x  e  !(v)  is  a  payoff 
distribution  of  the  worth  v  (N)  of  the  grand  coalition  N  which  gives  each  player  i  a 
payoff  Xi  which  is  at  least  as  much  as  he  can  obtain  when  he  operates  alone. 

Example  16.2  A  game  v  is  called  additive  if  v(S  U  T)  =  v(S)  +  v(T )  for  all  disjoint 
coalitions  S  and  T.  Such  a  game  is  completely  determined  by  the  worths  of  the  one- 
person  coalitions  v(i)  (i  e  N ),  since  v(S)  —  J2ies  v(0  f°r  everY  coalition  S.  For  an 
additive  game  u,  I(v )  consists  of  one  point:  I(v )  =  v (2), ... ,  v(n))}.  □ 

Note  that  for  a  game  v 


n 

I(v)  ^  0  if  and  only  if  v(N)  >  L  v(o  ■ 

i=  1 

For  an  essential  game  v ,  that  is,  a  game  with  v(N)  >  Y^l=\  Kv)  is  the 
convex  hull  of  the  points:  f^f2,...,^  where :=  v(k)  if  k  ^  i  and  fj  :  = 
v(N)  —  J2keN\{i}  v(k).  (See  Problem  16.1.) 


Example  16.3  Let  ( N ,  v)  be  a  three-person  game  with  f(l)  =  v(3)  =  0,  v(2)  =  3, 
i?(l,  2,  3)  =  5.  Then  I(v)  is  the  triangle  with  vertices  f1  =  (2,  3,  0),  f2  =  (0,  5,  0) 
and  f3  =  (0,  3, 2).  (See  Fig.  16.1.)  □ 


1 6.2  The  Core  and  the  Domination-Core 
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Fig.  16.1  Example  16.3 


(0,0,5) 


f  =(0,5,0) 


Definition  16.4  Let  (N,  v )  be  a  game.  Let  y,  z  g  I(v),  S  g  2n  \  {0}.  Then  y 
dominates  z  via  coalition  S ,  denoted  by  y  dom^  z,  if 

(1)  yi  >  Zi  for  all  i  g  S  , 

(2)  y(S)  <  v(S)  . 

For  y,  z  g  I(v),  y  is  said  to  dominate  z  (notation:  y  domz)  if  there  is  an  S  g  2n  \  {0} 
such  that  y  dom,s  z.  □ 

Thus,  imputation  y  dominates  imputation  z  via  coalition  S  if  y  is  better  than  z  for  all 
members  i  g  S — this  is  condition  (1) — and  the  payoffs  (yi)ies  are  attainable  for  the 
members  of  S  by  cooperation — this  is  condition  (2).  Against  each  z  in 

D(S )  {z  e  I(v)  |  there  exists  y  g  I(v)  with  y  dom^  z } 

the  players  of  S  can  protest  successfully.  The  set  D(S)  consists  of  the  imputations 
which  are  dominated  via  S.  Note  that  always  D(N)  —  0  (see  Problem  16.3).  We  call 
x  G  I(v )  undominated  if  x  G  I(v)  \  U,se2^\{0}  ^(^)- 

Example  16.5  Let  ( N ,  v)  be  the  three-person  game  with  i?(l,  2)  =  2,  v(N)  —  1  and 
v(S )  =  0  if  S  ^  {1,2},  N.  Then  D(S)  =  0  if  S  ^  {1, 2}  and  D({1, 2})  =  {x  g 
I(v)  |  V3  >  0}.  The  elements  x  in  I(v)  which  are  undominated  are  those  that  satisfy 

X3  ==  0.  □ 

1 6.2  The  Core  and  the  Domination-Core 

The  concept  of  domination  defined  in  the  preceding  section  gives  rise  to  the 
following  definition. 

Definition  16.6  The  domination  core  ( D-core )  of  a  game  (A,  v)  is  the  set 

DC(v)  :=  I(v)  \  U  D(S), 

Se2N\{0} 


294 


1 6  TU-Games:  Domination,  Stable  Sets,  and  the  Core 


i.e.,  the  set  of  all  undominated  elements  in  I(v).  The  core  of  a  game  (N,  v)  is  the  set 

C(v)  :=  {x  G  f(v)  |  x(S)  >  v(S)  for  all  S  e  2N  \  {0}}.  □ 

If  x  G  C(u),  then  no  coalition  S  ^  N  has  an  incentive  to  split  off  if  x  is  the  proposed 
payoff  distribution  in  A,  because  the  total  amount  v(S')  allocated  to  S  is  not  smaller 
than  the  amount  v  ( S )  which  the  players  in  S  can  obtain  by  forming  the  coalition  S. 

For  the  game  in  Example  16.5  the  D-core  is  nonempty  and  the  core  is  empty.  In 
general  the  following  holds. 

Theorem  16.7  The  core  is  a  subset  of  the  D-core  for  each  TU-game. 

Proof  Let  (N,  v )  be  a  game  and  x  g  I(v),  x  ^  DC(v).  Then  there  is  a  y  g  I(v)  and 
a  coalition  S  ^  0  such  that  y  dom^  x.  Thus,  v(S)  >  y(S)  >  v(S),  which  implies  that 

x  $  c(v).  m 

Elements  of  C(v)  can  easily  be  obtained  because  the  core  is  defined  with  the  aid 
of  linear  inequalities.  The  core  is  a  polytope.  Also  the  D-core  is  a  convex  set:  see 
Problem  16.2. 

A  natural  question  that  arises  is:  for  which  games  is  the  core  equal  to  the  D-core? 
Consider  the  following  condition  on  a  game  ( N ,  v): 

v(N)  >  v(S)  +  v(i)  f°r  all  $  g  2n  \  {0}.  (16.1) 

ieN\S 

It  turns  out  that  this  condition  is  sufficient  for  the  equality  of  core  and  D-core. 

Theorem  16.8  Let  ( N ,  v )  be  a  game  satisfying  (16.1).  Then  DC(v)  —  C(v). 

Proof  In  view  of  Theorem  16.7  it  is  sufficient  to  show  that  DC(v )  c  C(v). 

Claim  Let  x  G  I(v)  with  x(S )  <  v(S)  for  some  S,  then  there  is  a  y  G  !(v)  such  that 
y  dom^x. 

To  prove  this  claim,  define  y  as  follows.  If  i  G  S ,  theny*  :=  jq+|S|  l(v(S)—x(S)). 
If  i  (f  S ,  then  y,  :=  v(i)  +  (v(N)  -  v(S)  -  J2ieN\s  u(0)l^  \  ^l-1.  Then  y  G  I(v), 
where  y*  >  v(i)  for  i  G  N  \  S  follows  from  (16.1).  Furthermore,  ydom^x.  This 
proves  the  claim. 

To  prove  DC(v )  c  C(v),  suppose  x  g  DC(v).  Then  there  is  no  y  g  I(v)  with 
y  dom  x.  In  view  of  the  Claim  it  follows  that  v(S)  >  v  ( S )  for  all  .S'  G  2N  \  {0}.  Hence, 
x  g  C(v).  ■ 


Remark  16.9  Condition  (16.1)  is  satisfied  if  the  game  v  has  a  non-empty  core.  So 
in  that  case,  C(v)  —  DC(v).  See  also  Problem  16.11.  □ 


16.3  Simple  Games 
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Many  games  v  derived  from  practical  situations  have  the  following  property: 

v(SUT)  >  v(S)  +  v(T)  for  all  disjoint  S,TCN.  (16.2) 

A  game  satisfying  (16.2)  is  called  super-additive.  Observe  that  (16.2)  implies  (16.1), 
so  that  Theorem  16.8  holds  for  super- additive  games  in  particular. 


16.3  Simple  Games 

In  this  section  we  study  the  core  and  D-core  of  simple  games.  Simple  games  arise 
in  particular  in  political  situations,  see  for  instance  the  United  Nations  Security 
Council  example  in  Chap.  1. 


Definition  16.10  A  simple  game  ( N ,  v)  is  a  game  where  every  coalition  has  either 
worth  0  or  worth  1 ,  and  the  grand  coalition  N  has  worth  1 .  Coalitions  with  worth 
1  are  called  winning ,  the  other  coalitions  are  called  losing.  A  minimal  winning 
coalition  is  a  winning  coalition  for  which  every  proper  subset  is  losing.  A  player  i  is 
called  a  dictator  in  a  simple  game  ( N ,  v)  if  v(S)  =  1  if  and  only  if  i  £  S.  A  player  i 
is  called  a  veto  player  in  a  simple  game  (A,  v)  if  i  belongs  to  all  winning  coalitions. 
The  set  of  veto  players  of  v  is  denoted  by  veto(u).  Hence, 


veto(u)  =  (~^\{S  €  2N 


v(S)  =  1}. 


□ 


The  next  example  suggests  that  non-emptiness  of  the  core  has  something  to  do  with 
the  existence  of  veto  players. 

For  each  i  £  N  let  e*  £  W1  denote  the  vector  with  i- th  coordinate  equal  to  1  and 
all  other  coordinates  equal  to  0. 


Example  16.11  (1)  Let  i  £  N.  For  the  dictator  game  <5/,  which  is  the  simple  game 
with  8i(S)  —  1  if  and  only  if  i  £  S  one  has  7(5/)  =  {e*},  veto (5/)  =  {/}  and 
C(8i)  =  DC(8i)  =  {e'}. 

(2)  For  the  three-person  majority  game  with  v(S)  =  1  if  |Sj  £  {2,3}  and  v(S)  —  0 
if  |Sj  £  {0, 1}  one  has: 

{1, 2}  n  {1, 3}  fl  {2,  3}  fl  {1, 2,  3}  =  0  =  veto(v) 


and 


C(v)  =  DC(v )  =  0  . 

(3)  Let  T  be  a  nonempty  coalition.  For  the  T -unanimity  game  uj ,  which  is  the 
simple  game  with  uT(S )  =  1  if  and  only  if  T  c  S,  veto {uj)  —  T  and 


C(ut )  =  DC(ut )  =  conv{e'  |  i  £  T}  . 


□ 
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The  following  theorem  shows  that  the  core  of  a  simple  game  is  nonempty  if  and  only 
if  the  game  has  veto  players.  Furthermore,  core  elements  divide  the  total  amount 
v(N)  =  1  of  the  grand  coalition  among  the  veto  players.  The  D-core  is  equal  to  the 
core  for  simple  games  except  in  one  case  where  there  is  exactly  one  k  e  N  with 
v(k)  =  1  and  k  is  not  a  veto  player.  See  also  Example  16.13  below. 

Theorem  16.12  Let  (N,  v )  be  a  simple  game.  Then: 

(1)  C(v )  =  conv{el  e  W1  \  i  e  veto(v)}. 

(2)  If  veto(v)  =  0  and  {/  e  N  \  v(i)  —  1}  =  {k},  then  C(v)  =  0  and  DC (v)  = 
{e^}.  Otherwise,  DC(v)  =  C(v). 

Proof 

(a)  Suppose  /  £  veto(u).  Let  S  £  2^\{0}.If/  £  Sthen^(S)  =  1  >  v (S),  otherwise 
e\S )  =  0  =  v(S).  Obviously,  el(N )  =  1  =  v(N).  So  e?  £  C(v).  This  proves  the 
inclusion  3  in  (1)  because  C(v )  is  a  convex  set. 

(b)  To  prove  the  inclusion  c  in  (1),  let  x  £  C(v).  It  is  sufficient  to  prove:  i  $ 
veto(u)  =>>  Xi  —  0.  Suppose,  to  the  contrary,  that  xi  >  0  for  some  non-veto 
player  /.  Take  S  with  v{S)  —  1  and  i  $  S  (such  an  S  exists  otherwise  i  would  be 
a  veto  player).  Then  v(S)  =  x(N)  —  x(N  \  S )  <  1  —  Xi  <  1 ,  contradicting  the  fact 
that  x  is  a  core  element.  This  concludes  the  proof  of  (1). 

(c)  If  veto(u)  =  0  and  k  is  the  only  player  in  the  set  {/  e  N  \  v(i)  =  1},  then 
C(v)  =  0  by  part  (1),  whereas  I(v)  =  {e^},  hence  DC(v)  —  {e^}.  If  veto(u)  = 
0  and  {/  e  N  \  v(i)  =  1}  =  0  then  (16.1)  is  satisfied,  so  that  core  and  D-core 
are  equal  by  Theorem  16.8.  If  veto(t;)  =  0  and  |{/  e  N  \  v(i)  —  1}|  >2  then 
I(v)  —  0  so  that  C(v)  =  DC(v)  —  0. 

(d)  To  complete  the  proof  of  (2),  suppose  veto(u)  /  0.  Then  C(v)  ^  0  by  part  (1). 

Hence  C(v)  =  DC(v )  by  Remark  16.9.  ■ 

Example  16.13  Let  N  —  {1,2,3},  u(l)  =  v(2,  3)  =  v(l,  2,3)  =  1  and  v(S)  =  0 
for  the  other  coalitions.  Then  veto(u)  =  0,  C(v)  =  0,  DC(v )  =  {e1}.  Note  that 
this  simple  game  is  not  super- additive,  and  does  not  satisfy  (16.1).  □ 


16.4  Stable  Sets 

The  definition  of  a  stable  set  is  again  based  on  the  concept  of  domination.  By  way 
of  example,  let  v  be  the  three-person  game  with  all  worths  equal  to  1  except  for  the 
one-person  coalitions,  which  have  worth  equal  to  0.  Observe  that  the  three  vectors 
(|,  ^,0),  (|,0,  |),  and  (0,  |)  are  imputations  that  do  not  dominate  each  other. 

Moreover,  each  imputation  other  than  one  of  these  three  is  dominated  by  one  of 
these  three  (see  Problem  16.5).  For  this  reason,  von  Neumann  and  Morgenstern 
called  the  set  of  these  three  imputations  a  ‘solution’  of  the  game. 
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Definition  16.14  Let  v  be  a  game  and  let  A  c  I(v).  The  set  A  is  called  a  stable 
set  if 

(1)  if  x,  y  e  A  then  x  does  not  dominate  y, 

(2)  if  x  e  I(v)  \  A  then  there  is  a  y  e  A  that  dominates  x.  □ 

The  first  property  in  Definition  16.14  is  called  internal  stability  and  the  second  one 
external  stability. 

The  three-person  game  described  above  has  many  stable  sets:  see  Problem  16.5. 
But  even  if  a  game  has  only  one  stable  set  then  still  a  selection  would  have 
to  be  made,  for  practical  purposes;  stability,  however,  is  a  property  of  sets,  not 
of  single  payoff  distributions.  The  core  does  not  suffer  from  this  problem  and, 
moreover,  in  that  case  there  exist  some  plausible  choices  (like  the  nucleolus,  see 
Chap.  19).  Moreover,  games  with  non-empty  cores  have  been  exactly  characterized 
(see  Sect.  16.5),  whereas  the  problem  of  existence  of  stable  sets  is  only  partially 
solved. 

Some  partial  existence  results  are  given  now.  First,  essential  simple  games  always 
have  stable  sets: 

Theorem  16.15  Let  v  be  a  simple  game  and  let  S  be  a  minimal  winning  coalition. 
Let  As  be  the  set  of  those  imputations  x  with  Xi  =  0  for  every  i  f.  S.  Then,  if  As  ^  0, 
it  is  a  stable  set. 

Proof  Problem  16.8.  ■ 

A  game  ( N ,  v)  is  called  a  zero-one  game  if  all  one-person  coalitions  have  worth  0 
and  the  grand  coalition  N  has  worth  1 .  In  the  following  example  symmetric  three- 
person  zero-one  games  are  considered. 

Example  16.16  Let  ( N ,  v)  be  a  game  with  N  —  {1,2,3}  and  v(i)  =  0  for  all  i  e  N , 
v(N)  =  1,  and  v(S)  =  a  for  every  two-person  coalition  S ,  where  0  <  a  <  1.  Then: 

(a)  Let  a  >  | .  Then 


{(v,  v,  1  —  2x),  (v,  1  —  2x,x),  (1  —  2x,x,x)  \  —  <  x  <  -}  (16.3) 

is  a  stable  set. 

(b)  For  a  <  3’  the  set  in  (16.3)  is  internally  but  not  externally  stable.  The  union  of 
this  set  with  the  core  of  the  game  is  a  stable  set. 

(c)  For  a  <  |  the  core  is  a  (the  unique)  stable  set. 

For  the  proofs  of  these  statements  see  Problem  16.9.  □ 

The  next  theorem  gives  the  relation  between  the  domination  core  and  stable  sets. 
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Theorem  16.17  Let  (N,  v)  be  a  game.  Then: 

(a)  The  D-core  ofv  is  a  subset  of  any  stable  set. 

(b)  If  A  and  B  are  stable  sets  and  A  ^  B,  then  A  ^  B. 

(c)  Suppose  the  D-core  of  v  is  a  stable  set.  Then  it  is  the  unique  stable  set  of  the 
game. 

Proof  Problem  16.10.  ■ 


1 6.5  Balanced  Games  and  the  Core 


In  this  section  we  derive  the  Bondareva- Shapley  Theorem,  which  characterizes 
games  with  non-empty  cores  in  terms  of  balancedness.  First,  the  concepts  of 
balanced  maps,  collections,  and  games  are  introduced. 

Let  N  —  {1, 2, . . . ,  n).  A  map  X  :  2N  \  {0}  ->  M+  :=  {t  e  R  |  t  >  0}  is  called  a 
balanced  map  if 


y]  A(S)es  =  . 

S€2^\{0} 

Here  es  e  is  the  characteristic  vector  for  coalition  S  with 

ef  =  1  if  i  e  S  and  ef  =  0  if  i  e  N  \  S  . 

A  collection  B  of  nonempty  coalitions  is  called  a  balanced  collection  if  there  is  a 
balanced  map  A  such  that 

B  =  {S  <e2n  \  A  (5)  >  0}  . 


Example  16.18 

(1)  Let  the  nonempty  coalitions  Ah,  A^, . . . ,  AA  form  a  partition  of  N,  i.e.,  N  = 

[jkr=lNr  and  Ns  H  Nt  —  0  if  s  ^  t.  Then  {N\ ,  . . . ,  A^}  is  a  balanced 

collection,  corresponding  to  the  balanced  map  A  with  A  (S)  =  1  if  A  e 
{N\ ,  A/2, . . . ,  Nk}  and  A  (A)  =  0  otherwise. 

(2)  For  N  =  {1,2,3}  the  set  B  =  {{1, 2},  {1, 3},  {2,  3}}  is  balanced  and  corresponds 
to  the  balanced  map  A  with  X(S)  =  0  if  \S\  e  {1, 3}  and  X(S)  =  \  if  | S\  —2 .  □ 

In  order  to  have  an  interpretation  of  a  balanced  map,  one  can  think  of  each  player 
having  one  unit  of  time  (or  energy,  labor,  . . . )  to  spend.  Each  player  can  distribute 
his  time  over  the  various  coalitions  of  which  he  is  a  member.  Such  a  distribution  is 
‘balanced’  if  it  corresponds  to  a  balanced  map  A,  where  A (S)  is  interpreted  as  the 
length  of  time  that  the  coalition  S  exists  (‘cooperates’);  balancedness  of  A  means 
that  each  player  spends  exactly  his  one  unit  of  time  over  the  various  coalitions. 
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Definition  16.19  A  game  ( N ,  v)  is  called  a  balanced  game  if  for  each  balanced  map 
A  :  2N  \  {0}  ->  M+  we  have 


^A(SMS)  <  v(N)  .  (16.4) 

□ 


Extending  the  interpretation  of  a  balanced  map  in  terms  of  a  distribution  of  time  to 
a  game,  balancedness  of  a  game  could  be  interpreted  as  saying  that  it  is  at  least  as 
productive  to  have  the  grand  coalition  operate  during  one  unit  of  time  as  to  have  a 
balanced  distribution  of  time  over  various  smaller  coalitions — worths  of  coalitions 
being  interpreted  as  productivities.  Thus,  in  a  balanced  game,  it  seems  advantageous 
to  form  the  grand  coalition.  Indeed,  technically  the  importance  of  the  notion  of 
balancedness  follows  from  Theorem  16.22.  This  theorem  characterizes  games  with 
a  nonempty  core.  Its  proof  is  based  on  the  following  duality  theorem. 

For  x,  y  £  R”,  x  •  y  denotes  the  usual  inner  product:  x  •  y  = 

Theorem  16.20  Let  A  be  an  n  x  p-matrix,  b  £  W  and  c  £  R”,  and  let  {x  £ 
xA  >  b}  ^  0  and  {y  £  Rp  \  Ay  —  c,  y  >  0}  ^  0.  Then 

min{x  •  c  |  xA  >  b}  =  max{b  •  y  |  Ay  =  c,  y  >  0}  . 

Proof  Problem  16.13.  ■ 


Remark  16.21  In  Theorem  16.20  also  the  following  holds:  if  one  of  the  programs 
is  infeasible  (i.e.,  one  of  the  two  sets  in  the  theorem  is  empty),  then  both  programs 
do  not  have  an  optimal  solution  (i.e.,  neither  the  minimum  nor  the  maximum  are 
attained).  See  Problem  16. 14  for  a  proof.  □ 


Theorem  16.22  Let  ( N ,  v)  be  a  TU-game.  Then  the  following  two  assertions  are 
equivalent: 


(1)  C(v)  ±  0, 

(2)  (N,  v)  is  a  balanced  game. 

Proof  First  note  that  C(v )  ^  0  if  and  only  if 


n 


v(N)  =  min{y^x;  |  x  £  R^,  v(S')  >  v(S)  for  all  S  £  2N  \  {0}}  .  (16.5) 


i=  1 


By  the  duality  theorem,  Theorem  16.20,  equality  (16.5)  holds  if  and  only  if 


v(N)  —  max{ A  (S')  v  ( S )  |  A  (S)es  =  eN,X  >  0}  . 


(16.6) 
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(Take  for  A  the  matrix  with  the  characteristic  vectors  es  as  columns,  let  c  :=  eN  and 
let  b  be  the  vector  of  coalitional  worths.  Obviously,  the  non-emptiness  conditions  in 
Theorem  16.20  are  satisfied.)  Now  (16.6)  holds  if  and  only  if  (16.4)  holds.  Hence 
(1)  and  (2)  are  equivalent.  ■ 

An  alternative  proof  of  Theorem  16.22  can  be  based  directly  on  Farkas’  Lemma 
(Lemma  22.5):  see  Problem  16.15. 


16.6  Problems 

16.1.  Imputation  Set  of  an  Essential  Game 

Prove  that  for  an  essential  game  v,  I(v)  is  the  convex  hull  of  the  points  f1 ,  f2, . . . ,  f\ 

as  claimed  in  Sect.  16.1. 

16.2.  Convexity  of  the  Domination  Core 

Prove  that  for  each  game  the  domination  core  is  a  convex  set. 

16.3.  Dominated  Sets  of  Imputations 

(a)  Prove  that  for  each  game  ( N ,  v ),  D(S)  =  0  if  |S|  e  {1,  n}. 

(b)  Determine  for  each  S  the  set  D(S)  for  the  cost  savings  game  (three  communities 
game)  in  Chap.  1.  Answer  the  same  question  for  the  glove  game  in  Chap.  1. 

16.4.  The  Domination  Relation 

(a)  Prove  that  dom  and  dom^  are  irreflexive  relations  and  that  dom^  is  transitive 
and  antisymmetric. 

(b)  Construct  a  game  ( N ,  v)  and  imputations  x  and  y  such  that  x  domy  and  y  domx. 

(c)  Construct  a  game  (N,  v)  and  x,  y,  z  e  I(v)  with  xdomy  and  ydomz  and  not 
xdomz. 

16.5.  Stable  Sets  in  a  Three-Person  Game 

Let  ({1, 2,  3},  v)  be  the  game  with  all  worths  equal  to  1  except  for  the  one-person 

and  the  empty  coalitions,  which  have  worth  equal  to  0. 

(a)  Prove  that  each  element  of  the  imputation  set  of  this  game  is  dominated  by 
another  element. 

(b)  Prove  that  in  this  game  each  x  e  I(v)  \  A  is  dominated  by  an  element  of  A  := 

{(5.  5.0),  (5,0,  i),  (0,  5)}. 

(c)  If  c  e  [0,  |)  and  B  :=  {x  e  I(v)  |  V3  =  c},  then  each  element  of  I(v )  \  B  is 
dominated  by  an  element  of  B.  Show  this. 

16.6.  Singleton  Stable  Set 


1  See  Sect.  11.1  for  definitions. 
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Prove  that  if  a  game  (N,  v)  has  a  one-element  stable  set  then  v(N)  =  JfjeN  u(0- 

16.7.  A  Glove  Game 

Consider  the  three-person  simple  game  v  defined  by 


1  if  S  =  {1,2}  or  {2,  3}  or  {1,2,  3} 
0  otherwise. 


(a)  Show  that  any  imputation  (x\,X2,xf)  that  is  not  equal  to  e2  is  dominated  by 
another  imputation. 

(b)  Compute  the  core  and  the  domination  core. 

(c)  Show  that  the  domination  core  is  not  a  stable  set. 

(d)  Show  that 


B  :=  {(A,  1 


2A,  A)  |  0  <  A  < 


1 

2 


} 


is  a  stable  set. 

16.8.  Proof  of  Theorem  16.15 
Prove  Theorem  16.15. 

16.9.  Example  16.16 

Prove  the  statements  in  Example  16.16. 

16.10.  Proof  of  Theorem  16.17 

Prove  Theorem  16.17.  Does  this  theorem  also  hold  for  the  core  instead  of  the 
D-core? 

16.11.  Core  and  D-Core 

Is  (16.1)  also  a  necessary  condition  for  equality  of  the  core  and  the  D-core?  (Cf. 
Theorem  16.8.) 

16.12.  Strategic  Equivalence 

Let  (TV,  w)  be  strategically  equivalent  to  (TV,  v),  that  is,  there  are  k  e  R,  k  >  0  and 
a  e  such  that  for  each  coalition  S:  w(S')  =  kv(S)  +  a(S).  Show  that 


(i)  C(w)  =  kC(v)  +  a  (:=  {x  e  RN 

(ii)  DC(w)  =  kDC(v )  +  a. 


x  =  Ay  +  a  for  some  y  e  C(u)}) 


[The  equalities  (i)  and  (ii)  express  that  the  core  and  the  D-core  are  covariant  w.r.t. 
strategic  equivalence.] 

16.13.  Proof  of  Theorem  16.20 

Prove  Theorem  16.20.  [Hint:  use  Theorem  22.6.] 

16.14.  Infeasible  Programs  in  Theorem  16.20 

Prove  the  claim  made  in  Remark  16.21.  [Hint:  Suppose,  say,  that  there  is  no  y  >  0 
with  Ay  =  c.  Then,  certainly,  the  max-program  does  not  have  an  optimal  solution. 
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Use  Farkas’  Lemma  (Lemma  22.5)  to  conclude  that  there  exists  a  vector  z  with 
zA  >  0  and  z  •  c  <  0.  Suppose  the  min-program  is  feasible,  i.e.,  there  is  an  x  with 
xA  >  b.  Then,  show  that  the  min-program  does  not  have  an  optimal  solution  by 
considering  the  vectors  x  +  tz  for  t  g  R,  t  >  0.] 

16.15.  Proof  of  Theorem  16.22  Using  Lemma  22.5 

Prove  Theorem  16.22  with  the  aid  of  Lemma  22.5.  [Hint:  List  the  nonempty 
coalitions  S  c  N  as  S\ , . . . ,  Sp  (p  —  2n  —  1)  with  Sp  —  N.  Define  the  (n  +  n  +  p)  xp 
matrix  A  as  follows.  Column  k  <  p  is  (eSk ,  —eSk ,  — e^)  where:  eSk  g  R”,  esf  =  1 
if  i  G  Sk,  eSf  =  0  if  i  Sk.  Column  p  is  (eN ,  —eN,  0).  Then  C(N,  v)  ^  0  iff  there 
exists  (z,  z;,w)  G  W 7  x  W1  x  W  with  (z,z;,w)  >  0  and  (z,  zr,w)A  =  b,  where 
b  =  (v(Sk))pk=l.  This  has  the  form  as  in  (a)  of  Lemma  22.5.] 

16.16.  Balanced  Maps  and  Collections 

(a)  Show  that  for  any  balanced  map  A  one  has  A  (S')  >  1,  with  equality  if  and 

only  if  the  corresponding  balanced  collection  equals  {N}. 

(b)  If  B  is  a  balanced  collection  unequal  to  {A/},  then 

Bc  :=  {S  g  2*\  {0}  |  N\S  g  B} 

is  also  a  balanced  collection.  Give  the  corresponding  balanced  map. 

(c)  Let  S  g  2n  \  {0,  A/}.  Prove  that  {S,  (N  \  {i})ies}  is  balanced  collection. 

(d)  Prove  that  the  balanced  maps  form  a  convex  set  An. 

16.17.  Minimum  of  Balanced  Games 

Show  that  the  minimum  of  two  balanced  games  is  again  balanced. 

16.18.  Balanced  Simple  Games 

A  simple  game  has  a  non-empty  core  if  and  only  if  it  has  veto  players,  cf. 
Theorem  16.12(1).  Derive  this  result  from  Theorem  16.22. 


1 6.7  Notes 

The  concepts  of  domination,  imputation,  and  stable  set  were  introduced  by  von 
Neumann  and  Morgenstern  (1944/1947).  The  core  was  introduced  by  Gillies  (1953). 

Lucas  (1969)  gives  an  example  of  a(n  essential)  game  that  does  not  have  a  stable 
set;  see  also  Owen  (1995),  p.  253.  Theorem  16.22  is  due  to  Bondareva  (1962)  and 
Shapley  (1967). 

Problem  16.6  is  taken  from  Morris  (1994). 
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The  Shapley  Value 
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In  Chap.  16  set- valued  solution  concepts  for  games  with  transferable  utilities  were 
studied:  the  imputation  set,  core,  domination  core,  and  stable  sets.  In  this  chapter, 
a  one-point  (single-valued)  solution  concept  is  discussed:  the  Shapley  value.  It  may 
again  be  helpful  to  first  study  the  relevant  parts  of  Chaps.  1  and  9. 

Section  17.1  introduces  the  Shapley  value  by  several  formulas  and  presents  (a 
variation  on)  Shapley’ s  axiomatic  characterization  using  additivity.  In  Sect.  17.2  we 
present  three  other  characterizations  of  the  Shapley  value:  a  description  in  terms 
of  Harsanyi  dividends;  an  axiomatic  characterization  of  Young  based  on  strong 
monotonicity;  and  Owen’s  formula  for  the  Shapley  value  based  on  a  multilinear 
extension  of  games.  Section  11.3  discusses  Hart  and  Mas-Colell’s  approach  to  the 
Shapley  value  based  on  potential  and  reduced  games. 


1 7.1  Definition  and  Shapley's  Characterization 


Let  ( N ,  v)  be  a  TU-game  and  let  a  :  N  ->  N  be  a  permutation  of  the  player  set. 
Imagine  that  the  players  enter  a  room  one  by  one  in  the  ordering  a(l),  a (2),  . . ., 
c i(n )  and  give  each  player  the  marginal  contribution  he  creates  in  the  game.  To  be 
more  specific,  let  the  set  of  predecessors  of  i  in  o  be  the  coalition 


P*( i)  :={reN 


o  1  (r)  <  o  1  O')}  - 


For  example,  if  N  —  {1, 2,  3, 4,  5}  and  a(l)  =  2,  cr( 2)  =  5,  cr  (3)  =  4,  a( 4)  =  1, 
and  a (5)  =  3,  player  2  enters  first,  next  players  5,  4,  1,  and  3.  So  Fa(l)  =  {2,  5,  4}. 
Define  the  marginal  vector  m°  by 

m°  =  v(P0(i)  U  {/})  -  v(p0(i))  .  (17.1) 
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Thus,  the  marginal  vector  m°  gives  each  player  his  marginal  contribution  to  the 
coalition  formed  by  his  entrance,  according  to  the  ordering  cr. 

Definition  17.1  The  Shapley  value  <F(u)  of  a  game  (N,  v )  is  the  average  of  the 
marginal  vectors  of  the  game,  i.e. 

4>(0;=f  T  rn°  .  (17.2) 

n' 

aeU(N) 

Here,  n  (N)  denotes  the  set  of  permutations  of  N.  □ 

Example  17.2 

(1)  For  a  two-person  game  (N,  v)  the  Shapley  value  is 

,  v(N)  —  u(l)  —  v(2)  v(N)-v(l)-v(2) 

®(v)  =  p(l)  + - 2 - •  v(2)  + - 2 - 

(2)  Let  (N,  v)  be  the  three-person  game  with  u(l)  =  v(2)  —  v(3)  —  0,v(l,2)  —  4, 
v(l,  3)  =  7,  v(2,  3)  =  15,  u(l,  2,  3)  =  20.  Then  the  marginal  vectors  are  given 
in  Table  17.1.  The  Shapley  value  of  this  game  is  equal  to  ^(21, 45,  54),  as  one 
easily  obtains  from  this  table. 

(3)  The  Shapley  value  <F(u)  for  an  additive  game  is  equal  to  (u(l),  u(2), . . . , 

v(n)).  □ 

Based  on  (17.2),  a  probabilistic  interpretation  of  the  Shapley  value  is  as  follows. 
Suppose  we  draw  from  an  urn,  containing  the  elements  of  II (A/),  a  permutation  a 
(probability  (rd)-1).  Then  let  the  players  enter  a  room  one  by  one  in  the  order  a  and 
give  each  player  the  marginal  contribution  created  by  him.  Then  the  i- th  coordinate 
0/(u)  of  O(u)  is  the  expected  payoff  to  player  i  according  to  this  random  procedure. 


Table  17.1 

Example  17.2(2) 


(<t(1),  <t(2),  <t(3» 

m\ 

m3 

(1,2,3) 

0 

4 

16 

(1,3,2) 

0 

13 

7 

(2,1,3) 

4 

0 

16 

(2,3,1) 

5 

0 

15 

(3,1,2) 

7 

13 

0 

(3,2,1) 

5 

15 

0 

E 

21 

45 

54 

1  Of  course,  m°  depends  on  the  game  v . 
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Using  (17.1)  formula  (17.2)  can  be  rewritten  as 

<&,•(«)  =  T  v(PaU)  u  {/})  -  v(PJi))  .  (17.3) 

n' 

aell  (N) 

The  terms  at  the  right  hand  side  of  the  summation  sign  are  of  the  form  v(S  U  {/})  — 
v(S),  where  5  is  a  subset  of  N  not  containing  i.  For  how  many  orderings  does 
one  have  PG(i)  —  SI  The  answer  is  \S\\(n  —  1  —  |5|)!,  where  the  first  factor  \S\\ 
corresponds  to  the  number  of  orderings  of  S  and  the  second  factor  (n  —  1  —  \S\) !  to 
the  number  of  orderings  of  N  \  (5  U  {/}).  Hence,  (17.3)  can  be  rewritten  to  obtain 

^  |5|!(u-  1  -  151)!,  , 

*.-(«)  =  E  — - j— u  {*■»  -  »(S))  .  (17.4) 

Z — '  n\ 

S:igS 


Note  that 


|S|!(n-l-|S|)!  _  1 

n\  ~  n  V'  )  • 

This  gives  rise  to  a  second  probabilistic  interpretation  of  the  Shapley  value. 
Construct  a  subset  5  to  which  i  does  not  belong,  as  follows.  First,  draw  at  random 
a  number  from  the  urn  containing  the  numbers  (possible  sizes)  0,  1,2 , ,n  —  1, 
where  each  number  has  probability  n~[  to  be  drawn.  If  size  s  is  chosen,  draw  a  set 
from  the  urn  containing  the  subsets  of  N  \  {/}  of  size  s ,  where  each  set  has  the  same 

probability  ("-1)  1  to  be  drawn.  If  5  is  drawn  with  |5|  =  s ,  then  pay  player  i  the 
amount  v(S  U  {/})  —  v(S).  Then,  in  view  of  (17.4),  the  expected  payoff  for  player  i 
in  this  random  procedure  is  the  Shapley  value  for  player  i  of  the  game  (N,  v). 

We  next  give  an  axiomatic  characterization  of  the  Shapley  value.  That  is,  we 
formulate  a  number  of  properties  that  a  one-point  solution  should  (or  might)  have 
and  then  show  that  the  Shapley  value  is  the  only  solution  with  these  properties. 

Definition  17.3  A  value  on  QN  is  a  map  :  QN  — >  M.N . 2  □ 

The  following  axioms  for  a  value  iff  :  QN  — >  are  used  in  the  announced 

characterization  of  the  Shapley  value. 

Efficiency  (EFF):  YTi=  i  V7(V)  —  v(N)  for  all  v  e  QN. 

The  efficiency  (sometimes  called  Pareto  optimality  or  Pareto  efficiency)  axiom 
needs  no  further  explanation. 

Call  a  player  i  in  a  game  ( N ,  v)  a  null-player  if  v(S  U  i)  —  v(S)  =  0  for  every 
coalition  5  E  2N.  Such  a  player  does  not  contribute  anything  to  any  coalition,  in 


2  Occasionally,  also  the  word  solution  will  be  used. 
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particular  also  v(i)  =  0.  So  it  seems  reasonable  that  such  a  player  obtains  zero 
according  to  the  value.  This  is  what  the  following  axiom  requires. 

Null-player  Property  (NP):  x/rfiv)  —  0  for  all  v  e  QN  and  all  null-players  i  in  v. 

Call  players  i  and  j  symmetric  in  the  game  (N,v)  if  v(S  U  i)  =  v(S  U  j)  for 
every  coalition  S  c  N  \  {i,j}.  Symmetric  players  have  the  same  contribution  to 
any  coalition,  and  therefore  it  seems  reasonable  that  they  should  obtain  the  same 
payoff  according  to  the  value.  That  is  the  content  of  the  following  axiom. 


Symmetry  (SYM):  =  xf/fiv)  for  all  v  e  QN  and  all  symmetric  players  i  and  j 

in  v. 

In  order  to  formulate  the  last  axiom  in  the  announced  characterization,  we  define 
the  sum  (N,  v  +  w)  of  two  games  (N,  v)  and  (N,  w)  by  (v  +  w)(S)  =  v(S)  +  w(S) 
for  every  S  e  2N . 

This  last  axiom  can  be  interpreted  as  follows.  Suppose  the  game  (N,  v )  is  played 
today  and  the  game  ( N ,  w)  tomorrow.  If  the  value  x/r  is  applied  then  player  i  obtains 
in  total:  xj/fiN,  v)  +  xj/fiN,  w).  One  may  also  argue  that,  in  total,  the  game  ( N ,  v  +  w) 
is  played  and  that,  accordingly,  player  i  should  obtain  x/rfiN,  v  +  w).  The  following 
axiom  expresses  the  possible  point  of  view  that  these  two  evaluations  should  not 
make  a  difference. 

Additivity  (ADD):  x/s(v  +  w)  =  xfriv)  +  xf/iw)  for  all  v,  w  e  QN . 

The  announced  characterization  is  the  following  theorem. 

Theorem  17.4  Let  xf/  :  QN  M.N  be  a  value.  Then  xjs  satisfies  EFF,  NP,  SYM,  and 
ADD,  if,  and  only  if,  xjf  is  the  Shapley  value  <h. 


The  proof  of  Theorem  17.4  uses,  through  the  additivity  axiom,  the  fact  that  QN  is 
a  linear  space,  with  addition  v  +  w  and  scalar  multiplication  ( av)(S )  =  av(S)  for 
all  v,  w  e  QN ,  S  e  2N ,  and  a  e  M.  An  obvious  basis  for  QN  is  the  set  {It  g  Qn  \ 
T  e  2N  \  {0}},  where  It  is  the  game  defined  by  1  t(T)  —  1  and  1  t(S)  —  0  for  all 
S  ^  T  (cf.  Problem  17.1).  This  basis  is  not  very  well  suited  for  the  present  purpose 
because  the  Shapley  value  0(1  T)  (T  e2^\{0})  cannot  easily  be  determined  from 
the  axioms  in  Theorem  17.4:  in  particular,  there  is  no  null-player  in  the  game  \  j  (cf. 
Problem  17.1). 

Another  basis  is  the  collection  of  unanimity  games  {uT  e  QN  \  T  £  2^\{0}}, 
see  Example  16.1 1(3)  for  the  definition,  and  Problem  17.2.  This  basis  is  used  in  the 
following  proof. 


Proof  of  Theorem  17.4  That  the  Shapley  value  satisfies  the  four  axioms  in  the 
theorem  is  the  subject  of  Problem  17.3. 
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Conversely,  suppose  xj/  satisfies  the  four  axioms.  It  has  to  be  proved  that  xj/  =  <§. 
Take  v  G  QN .  Then  there  are  unique  numbers  ct  (T  ^  0)  such  that  v  —  ctUt- 

(Cf.  Problem  17.2.)  By  ADD  of  x/r  and  d>  it  follows  that 

f(v)  =  CTUT )  ,  $(«)  =  <$>(cTuT)  . 

r/0  r/0 

So  it  is  sufficient  to  show  that  for  all  T  ^  0  and  cgI: 

\I/{cut )  =  d>(c^r)  .  (17.5) 

Take  T  ^  0  and  c  G  R.  Note  first  that  for  all  i  G  N  \  T: 

cut(S  U  {/})  —  cut(S)  —  0  for  all  S  , 

implying  that  i  is  a  null-player  in  cut •  So,  by  NP  of  xj/  and  d>: 

xjfi(cuT)  —  O iicuj )  =  0  for  all  i  G  N  \  T  .  (17.6) 

Now  suppose  that  ij  e  T,i  ^  j.  Then,  for  every  coalition  S  c  N\{i,j},  cuT(SUi )  = 
cuT(S  U  j)  —  0,  which  implies  that  i  and  j  are  symmetric  in  cut •  Hence,  by  SYM  of 
and  0: 


0/(ct/^)  =  ^>j(cut)  for  all  ij  g  T 


(17.7) 


and 


xj/f (cut)  —  xI/j(cut)  for  all  i,jeT.  (17.8) 

Then  EFF  and  (17.6)— (17.8)  imply  that 

x//i(cuT)  —  O i(cuT )  =  |T|_1cfor  all  i  e  T  .  (17.9) 

Now  (17.6)  and  (17.9)  imply  (17.5).  ■ 

The  following  two  axioms  (for  a  value  x/f)  are  stronger  versions  of  the  null-player 
property  and  symmetry,  respectively.  Call  a  player  i  in  a  game  (. N ,  v)  a  dummy 
player  if  v(S  U  i)  —  v(S)  —  v(i)  for  all  S'  c  N  \  {/}. 

Dummy  player  Property  (DUM):  x]/i(v)  =  r>(z)  for  all  v  e  QN  and  all  dummy 
players  i  in 

A  dummy  player  only  contributes  his  own  worth  to  every  coalition,  and  that  is  what 
he  should  be  payed  according  to  the  dummy  property. 
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For  a  permutation  a  G  IT  (A)  define  the  game  v°  by 

va(S)  :=  u(a_1(S))  for  all  .S'  e  2N 

and  define  a*  :  RN  ->  RN  by  (a*(v))CT(£)  :  =  Xk  for  all  v  g  Rn  and  k  e  N. 

Anonymity  (AN):  —  (7*(\[f(v))  for  all  v  G  QN  and  all  o  G  11(A). 

Anonymity  implies  that  a  value  does  not  discriminate  between  the  players  solely  on 
the  basis  of  their  ‘names’  (i.e.,  numbers). 

The  dummy  player  property  implies  the  null-player  property,  and  anonymity 
implies  symmetry.  The  Shapley  value  has  the  dummy  player  property.  See  Prob¬ 
lem  17.4  for  these  claims. 

The  Shapley  value  is  also  anonymous. 

Lemma  17.5  The  Shapley  value  0  is  anonymous. 

Proof 

(1)  First  we  show  that 

p*(m°  (v))  —  mpo  (vp)  for  all  i;  g  QN ,  p,cx  e  tt(N)  . 

This  follows  because  for  all  i  g  N : 


mpa(vp)pa(  i)  =  vp({pa(l),...,pa(i)}  -  vp({pcr(l), . . . ,  pcr(i  -  1)}) 

=  u(Ml),  •  •  - ,  cr(0})  -  f({cr(l), ...  -  1)}) 

=  (mff  (u))<7(i)  =  (p*  (v)) W)  • 

(2)  Take  v  e  QN  and  p  e  n(A').  Then  (1),  the  fact  that  p  i->-  pa  is  a  surjection  on 
I1(7V)  and  the  linearity  of  p*  imply: 


=  -  V  nf{vp)  =  = 

n\  n\ 


a  g  n  (/V) 


o 


=  —7  p* (rtf  (v))  =  p*(±-  J2m°)  =  p*(<D( v))  . 
n\  L — '  n\  L — ' 


a 


a 


This  proves  the  anonymity  of  <F. 


We  have  seen  so  far  that  the  Shapley  value  has  many  appealing  properties.  In 
the  following  sections  even  more  properties  and  characterizations  of  the  Shapley 
value  are  considered.  However,  it  also  has  some  drawbacks.  In  a  balanced  game  the 
Shapley  value  does  not  necessarily  assign  a  core  element.  Also,  it  does  not  have  to 
be  individually  rational  (cf.  Problem  17.5). 
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Variations  of  the  Shapley  value,  obtained  by  omitting  the  symmetry  (or 
anonymity)  requirement  in  particular,  are  discussed  in  Chap.  18. 


1 7.2  Other  Characterizations 

In  this  section  three  other  characterizations  of  the  Shapley  value  are  discussed:  the 
dividend  approach,  an  axiomatization  based  on  a  monotonicity  condition,  and  the 
so-called  multilinear  approach. 


17.2.1  Dividends 

With  each  game  we  can  associate  another  game  by  computing  the  dividends  of 
coalitions. 

Definition  17.6  Let  (. N ,  v )  be  game.  For  each  coalition  T  the  dividend  A V(T)  is 
defined,  recursively,  as  follows. 

A,(0)  :=  0 

A„(7)  :=  v(T)~  J2  A„(S)if  |r|  >  1  . 

S:S£T 


□ 


The  relation  between  dividends  and  the  Shapley  value  is  described  in  the  next 
theorem.  The  Shapley  value  of  a  player  in  a  game  turns  out  to  be  the  sum  of  all 
equally  distributed  dividends  of  coalitions  to  which  the  player  belongs. 


Theorem  17.7  Let  v  =  ^ZTe2N\{0 }  ctUt  (os  in  Problem  17.2).  Then: 

(a)  A  V(T)  —  ct  for  all  T  ^  0. 

(b)  The  Shapley  value  <F;(u)  for  player  i  is  equal  to  the  sum  of  the  equally 
distributed  dividends  of  the  coalitions  to  which  player  i  belongs  i.e., 


®,(v)  =  L 

T:ieT 


A  V(T) 
1 71 


Proof  In  the  proof  of  Theorem  17.4  it  was  shown  that  <F( cjuj )  =  \T\~1ct^t  for 
each  T,  so  by  ADD,  <E>(u)  =  cr\T\~leT.  Hence,  0,(u)  =  JfT-ieT  cj\T\~l . 

The  only  thing  left  to  show  is  that 


cT  =  A V(T)  for  all  T  ^  0  . 


(17.10) 
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The  proof  of  this  is  done  by  induction.  If  \T\  =  1,  say  T  =  {/},  then  cj  — 
v(i)  —  A V(T).  Suppose  (17.10)  holds  for  all  S  £  T.  Then  A V(T)  =  v(T)  —  J2S^T 

A V(S)  =  v(T)  -  J2s£t  cs  =  ct  because  v(T)  =  J2sqt  cs-  ■ 

1 7.2.2  Strong  Monotonicity 

The  Shapley  value  obviously  has  the  property  that  if  a  player  contributes  at  least  as 
much  to  any  coalition  in  a  game  v  than  in  a  game  w,  then  his  payoff  from  the  Shapley 
value  in  v  is  at  least  as  large  as  that  in  w.  Formally,  the  Shapley  value  satisfies  the 
following  axiom  for  a  value  :  QN  ->  RN\  a  proof  is  immediate  from  (17.4). 

Strong  Monotonicity  (SMON):  fii(v)  >  fi (w)  for  all  f ,  w  e  QN  that  satisfy 

v(S  U  {/})  -v(S)  >  w(S  U  {/})  -  w(S)  for  all  S  e  2N. 

We  now  show  that  together  with  efficiency  and  symmetry  this  axiom  characterizes 
the  Shapley  value.  For  fey  denote  by  eT  e  RN  the  vector  with  ef  =  1  if  i  e  T 
and  ef  =  0  if  i  £  T. 

Theorem  17.8  Let  \jf  :  QN  M.N  be  a  value.  Then  if/  satisfies  EFF,  SYM,  and 
SMON,  if  and  only  iff  is  the  Shapley  value  <F. 

Proof  Obviously,  satisfies  the  three  axioms.  Conversely,  suppose  f  satisfies  the 
three  axioms. 

(1)  Let  z  be  the  game  that  is  identically  zero.  In  this  game,  all  players  are  symmetric, 
so  SYM  and  EFF  together  imply  i /r(z)  —  0. 

(2)  Let  i  be  a  null-player  in  a  game  v.  Then  the  condition  in  SMON  applies  to  z 
and  v  with  all  inequalities  being  equalities.  So  SMON  yields  fi(v)  >  fit (z)  and 
fii  (z)  >  fifv).  Hence  by  (1),  ffv)  =  0. 

(3)  Let  c  e  R  and  T  e  2N  \  {0}.  Then  (2)  implies  fifcur)  —  0  for  every  i  e 
N  \T.  This,  SYM,  and  EFF  imply  fifcur)  —  c\T\~l  for  every  i  e  T.  Hence, 
xJ/(cut)  =  c|r|_1er. 

(4)  Each  v  G  QN  can  be  written  in  a  unique  way  as  a  linear  combination  of  {ut  | 
T  e  2N  \  {0}}  (see  Problem  17.2).  So  v  is  of  the  form  cjuj.  The  proof  of 

=  0(f)  will  be  completed  by  induction  on  the  number  a(v)  of  terms  in 
cjUt  with  ct  0. 

From  (1),  fi(v)  —  O(f)  =  0  if  af(f)  =  0,  and  from  (3),  —  0(f)  if  a(v)  —  1 

because  O (cut)  —  c\T\~[eT .  Suppose  f(w)  —  0(w)  for  all  w  e  QN  with  a(w)  <  k , 
where  k  >  2.  Let  f  be  a  game  with  af(f)  =  k.  Then  there  are  coalitions  7),  T2,  . . ., 
Tk  and  real  numbers  c\,  c2,  . . .,  ck,  unequal  to  zero,  such  that  v  =  Jfkr=i  cr^Tr-  Let 
D:=nkr=lTr. 
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For  i  e  N  \  D,  define  wl  :=  J2r-ieTr  cruTr  •  Because  a(wl)  <  k ,  the  induction 
hypothesis  implies:  xj/i(wl)  —  <F/(vF).  Further,  for  every  S  e  2N: 

k  k 

v(S  U  l)  —  v(S)  =  crUTr(S  U  i)  —  ^  CrUTr(S ) 

r=  1  r=  1 

=  y^  cruTr(s  u  o  -  y^  cruTr(S) 

r:i£Tr  r:i€Tr 

=  w\S  U  0  -  vF(S)  , 

so  that,  by  SMON  of  /  and  0,  it  follows  that  xjfi(v)  =  /,(vF)  —  <F;(vF)  — 

So 


^■(i?)  =  0;(i>)  for  all  i  e  N  \  D  .  (17.1 1) 

Equation  (17.11)  and  EFF  for  /  and  0  yield 

y>i(o  =  y>i(«o .  (i7.i2) 

ieD 


Let  ij  e  D ,  then  for  every  S'  c  Af  \  {/j}: 

&  & 

(0  =)  u(5  U  i)  =  y]  cruTr  (S  U  0  =  y ^  cruTr(S  u  j)  =  u(S  Uj)  , 

r=  1  r=  1 

so  i  and  j  are  symmetric.  Hence,  by  SYM  of  /  and  <F: 


V^O)  =  x//j(v),  ®i(v)  =  ®j(v)  . 
Now  x[/(v)  =  O(u)  follows  from  (17.1 1)— (17. 13). 


(17.13) 


17.2.3  Multilinear  Extension 

The  Shapley  value  of  a  game  may  also  be  described  by  means  of  the  multilinear 
extension  of  a  game.  Let  (TV,  v )  be  game.  Consider  the  function/  :  [0, 1]^  ->  R  on 
the  hypercube  [0, 1]^,  defined  by 


fix i,X2,...,Xn)  =  ^  Y\Xi  n  ~Xi ) 

Se2N  \  i^S  ieN\S 


v(S) 


(17.14) 
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Observe  that  the  set  of  extreme  points  of  [0,  1]^,  ext([0,  l]iV),  is  equal  to  {es 
2N}.  Clearly, 


S  e 


f(es)  =  v(S)  for  each  S  €  2N  .  (17.15) 

So/  can  be  seen  as  an  extension  of  v:  ext([0,  1  ]:V)  — >  M  with  ti(es)  :=  Wes').  In  view 
of  Problem  17.8,/  is  called  the  multilinear  extension  of  ( v  or)  v. 

One  can  give  a  probabilistic  interpretation  of/(x).  Suppose  that  each  of  the 
players  i  e  N,  independently,  decides  whether  to  cooperate  (probability  xt)  or  not 
(probability  1  —xi).  So  with  probability  flies  xi  rXev\s  (1  ~xd  the  coalition  S  forms, 
which  has  worth  v  (S).  Then /(x)  as  given  in  (17.14)  can  be  seen  as  the  expectation 
of  the  worth  of  the  formed  coalition. 

Another  interpretation  is  to  see  x  e  [0,  1]^  as  a  fuzzy  set,  where  jq  is  the  intensity 
of  availability  of  player  i  and  to  see  /  as  a  characteristic  function,  defined  for  fuzzy 
coalitions  in  N. 

Denote  by  Djf(x)  the  derivative  off  w.r.t.  the  k-th  coordinate  in  x.  The  following 
result  provides  another  description  of  the  Shapley  value,  namely  as  the  integral  along 
the  main  diagonal  of  [0,  1]^  of  Djf. 

Theorem  17.9  O^(n)  =  (Djf) (t,  t, ... ,  t)dtfor  each  k  e  N. 

Proof 


DkfiX)  =  Y 

T:keT 


n  *  n  ° 

i£T\{k]  i€N\T 


v(T) 


-E 

S:kgS 


PD  n  (i_x>) 

its  ieN\(SU{k}) 


=  E 

S:k&S 


n*  n 

ieS  /e,V\(.VU{A.-}) 


(v(S  U  {k}) 


v(S))  . 


Hence,  /0‘  (A/)(l,  t,...,  t)dt  =  J2s-.k?s  (fo  *|S|  0  ~  0”  |S|  ldt)  (v(S  U  {k})  -  v(S)). 
Using  the  well-known  (beta-)integral  formula  (cf.  Problem  17.9) 


n~lsl~idt  = 


IglKwHgl 

n\ 


3Cf.  Sect.  22.6. 
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it  follows  that  (A/)(r,  t, . . . ,  t)dt  =  J2s-k?s  |5|!("  Jf1  1)!  (v(S  U  {£})  -  v(S))  = 

<&*(«)  by  (17-4).  ■ 


Example  17.10  Let  (N,  v )  be  the  three-person  game  with  t>(l)  =  v(2)  =  v(3)  — 
v(l,2)  =  0,  v(l,  3)  =  1,  v(2, 3)  =  2,  v(N)  =  v(l,2,3)  =  4.  Then/(xi,x2,x3)  = 
X]  ( 1  —  x2  )a';,  +  2(1  —  X| )  x2x3  +  4a|A2a'3  =  A|A3  +  2a2a3  +  X|A2a'3  for  all  x  e  [0,  l]w. 

So  D\f(x)  —  X3  +  X2X3,  D^f  (x)  —  2.X3  H-  ^1^3?  (x)  —  X\  +  2x2  H-  %\X2- 

Theorem  17.9  implies: 


^i(u)  =  [  D]f(t,  t,  t)dt 
Jo 


5 

6’ 


02(i;) 


(3/^  +  t2)dt  =  1  - 


□ 


1 7.3  Potential  and  Reduced  Game 

This  section  starts  with  discussing  the  potential  approach  to  the  Shapley  value.  The 
potential  is,  in  a  sense,  dual  to  the  concept  of  dividends.  Next,  reduced  games  are 
considered,  which  leads  to  another  axiomatic  characterization  of  the  Shapley  value. 


1 7.3.1  The  Potential  Approach  to  the  Shapley  Value 

Denote  by  Q  the  family  of  all  games  (N,  v)  with  an  arbitrary  finite  (player)  set  N 
(not  necessarily  the  set  of  the  first  n  natural  numbers).  It  is  convenient  to  include 
also  the  game  (0,  v),  with  empty  player  set.  Thus, 

g  =  U  gN . 

iVCN,  |V|<oo 

Definition  17.11  A  potential  is  a  function  P  :  Q  R  satisfying 

P(0,u)  =  0  (17.16) 

DjP(N ,  v)  =  v(N)  for  all  (N.  v)  €  Q  .  (17.17) 

ieN 


Here  DiP(N,  v )  :=  P(N,  v )  —  P(N  \  where  v  in  the  last  expression  is  the 

restriction  to  N  \  {/}.  □ 
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If  P  is  a  potential  then  (17.17)  says  that  the  gradient  grad  P(N,  v)  :=  (DjP(N,  v))ieN 
is  an  efficient  payoff  vector  for  the  game  (N,  v). 

Note  that 


P({i},v)  =  v({i}) 

P({iJ }.  «)  =  \(v({i,j})  +  «({/})  +  v(m 

if  P  is  a  potential.  More  generally,  it  follows  from  (17.17)  that 

P(N,v)  =  \N\~l(v(N)  +  £>(JV\{i},u))  .  (17.18) 

ieN 

So  the  potential  of  (TV,  v)  is  uniquely  determined  by  the  potential  of  subgames  of 
(TV,  v),  which  implies  with  (17.16)  the  first  assertion  in  Theorem  17.12  below.  The 
second  assertion  in  this  theorem  connects  the  potential  of  a  game  to  the  dividends 
(see  Sect.  17.2.1)  of  a  game.  Further,  the  gradient  of  P  in  (TV,  v )  is  equal  to  the 
Shapley  value  of  (N,  v).  The  theorem  also  provides  an  algorithm  to  calculate  the 
Shapley  value,  by  calculating  with  (17.16)  and  (17.18)  the  potentials  of  the  game 
and  its  subgames  and  then  using  Theorem  17.12(c). 

Theorem  17.12 

(a)  There  is  a  unique  potential  P  :  Q  R. 

(b)  P(N,  v)  —  ct\T\  1  for  v  —  ctUt- 

(c)  grad  P(N,  v )  =  <T>(TV,  u). 

Proof  Part  (a)  follows  immediately  from  (17.16)  and  (17.18).  For  (b)  and  (c),  let 
Q  :  Q  ->  R  be  defined  by  g(0,  v)  0  and  Q(N,  v )  :=  J]rG2^\{0}  ct\T\~1  for  all 
(TV,  v),  where  v  —  cjuj,  if  TV  ^  0.  Further,  for  each  (N,  v)  and  i  e  N 

DiQ(N,  v)  —  J2  ct\T\~1-  J2  c't\t\~'  (17.19) 

0^rav  0^rc^v\{/} 

if  v  =  ctUt  and  v'  —  X^0^:rav\{z}  ctut  where  v'  is  the  restriction  of  v  to 

2^\(d  since?  for  each  S  ^  N  \  {/},  we  have 

cTuT(S)  =  v(S)  =  v'(S)  =  J2  ctut(S ) 

0^rCA(\{/}  &^TQN\{i} 
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we  obtain  by  recursion  that  ct  —  c'T  for  all  0  ^  T  ciV  \  {/}.  But  then  (17.19)  and 
Theorem  17.7  imply 


DiQ(N,  v)  =  J2  ct\T\~1  =  ®i(N,  v )  . 

T:  ieT 


Further,  by  efficiency  (EFF)  of  0: 

Y.DiQiN,  i>)  =  E  v)  =  v(N)  . 

ieN  ieN 

So  Q  is  a  potential.  From  (a)  it  follows  that  Q  —  P  and  then  (c)  holds.  ■ 


The  potential  of  a  game  can  also  be  expressed  directly  in  terms  of  the  coalitional 
worths,  as  in  the  following  theorem. 


Theorem  17.13  For  each  game  (TV,  v)  e  Q: 


P(N ,  v) 


E 

SON 


(|S|-l)!(|iV|-|S|)! 

\N\l 


Proof  Let  for  each  (N,  v )  e  Q. 


Q(N,  v)  :=  2^ - — - v(s ) 


SON 


|tf|! 


Then  <2(0,  v )  =  0.  It  is  sufficient  for  the  proof  of  Q(N,  v )  =  P(N,  v),  to  show  that 
DiQ(N ,  u)  =  0;(A/)  v )  for  all  i  e  N.  Now 


P>iQ(N,  v)  =  Q(N ,  u)  -  g(Af  \  {/},  v) 

V-  (|71  -  m\N\  -  IT’D!  ^ 

=  E  - 7T7TV - V(T) 


TON 


v-  (|S|-1)!(M-1-|S|)!  ,c, 

-  E  - — — 


.S'C,V\(/j 


i\N\  - 1): 


=  E  - ^ - ^(5u{;}) 


SQN\{i} 


|JV|! 


,  V-  (\S\  -  1)!(|JV|  -  |S|)!  ,c, 

+  E  - — - v(s) 


.S'C,V\(/j 


\N\l 
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v-  (|S|-1)!(M-1-|S|)!  ,c, 

-  E  - — — ^ 


SQN\{i} 


(M-l)! 


2^  - —< - (v(SU  {i})-v(S))  =  $i(N,v) 


S^N\{i} 


M! 


where  the  last  equality  follows  from  (17.4). 


A  probabilistic  interpretation  of  Theorem  17.13  is  the  following.  The  number 
\N\~1P(N,  v )  is  the  expectation  of  the  normalized  worth  |5|_1i;(5)  of  the  formed 

coalition  S  c  N  if  the  probability  that  S  forms  is  |A|_1(j^)  1  (corresponding  to 
drawing  first  a  size  j  G  {1,2,...,  \N\}  and  then  a  set  S  with  S|  =  s). 

Theorem  17.12  can  be  used  to  calculate  the  Shapley  value,  as  the  following 
example  shows. 

Example  17.14  Consider  the  three-person  game  ( N ,  v)  given  in  Table  17.2.  The 
dividends  of  the  subcoalitions  and  the  potential  of  the  subgames  are  given  in  lines  3 
and  4  of  this  table,  respectively.  It  follows  that 


&(N,v)  =  (DlP(N,v),D2P(N,v),D3P(N,v)) 


0({2, 3>,  u)  =  (7  —  3, 7  —  2)  =  (4 ,5). 


□ 


17.3.2  Reduced  Games 

To  introduce  the  concept  of  a  reduced  game,  consider  the  game  in  Example  17.14. 
Suppose  that  the  players  agree  on  using  the  Shapley  value,  and  consider  the  coalition 
{1,2}.  If  players  1  and  2  pool  their  Shapley  value  payoffs  then  together  they  have 

1  1  r\ 

3^  +  5j  =  8|.  Another  way  to  obtain  this  amount  is  to  take  the  worth  of  the  grand 
coalition,  15,  and  to  subtract  player  3’s  payoff,  6|.  Consider  {1}  as  a  subcoalition 
of  {1, 2}.  Player  1  could  form  a  coalition  with  player  3  and  obtain  the  worth  6,  but 


Table  17.2  Example  17.14 


5 

0 

{1} 

{2} 

{3} 

0,2} 

0,3} 

{2,3} 

{1,2,3} 

v(S) 

0 

1 

2 

3 

5 

6 

9 

15 

A  V(S) 

0 

1 

2 

3 

2 

2 

4 

1 

A»CS)/|S| 

0 

1 

2 

3 

1 

1 

2 

i 

3 

P(S,  v ) 

0 

1 

2 

3 

4 

5 

7 

10i 

Q(N,v)  =  (3i,5±,6±) 
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he  would  have  to  pay  player  3  according  to  the  Shapley  value  of  the  two-player 
game  ({ 1 , 3},  v),  which  is  the  vector  (2,4);  recall  that  the  players  agree  on  using  the 
Shapley  value.  So  player  1  is  left  with  6  —  4  =  2.  Similarly,  player  2  could  form  a 
coalition  with  player  3  and  obtain  v(2,  3)  =  9  minus  the  Shapley  value  payoff  for 
player  3  in  the  game  ({2,  3},  v),  which  is  5.  So  player  2  is  left  with  9  —  5  =  4.  So 
a  ‘reduced’  game  ({1,2},  v)  has  been  constructed  with  f(l)  =  2,  v(2)  —  4,  and 
v(l,  2)  =  8|.  The  Shapley  value  of  this  game  is  the  vector  (3^,5^).  Observe  that 
these  payoffs  are  equal  to  the  Shapley  value  payoffs  in  the  original  game.  This  is  not 
a  coincidence;  the  particular  way  of  constructing  a  reduced  game  as  illustrated  here 
leaves  the  Shapley  value  invariant. 

A  general  game  theoretic  principle  is  the  following.  Suppose  in  a  game  a  subset 
of  the  players  consider  the  game  arising  among  themselves;  then,  if  they  apply  the 
same  ‘solution  rule’  as  in  the  original  game,  their  payoffs  should  not  change — they 
should  have  no  reason  to  renegotiate.  Of  course,  the  formulation  of  this  principle 
leaves  open  many  ways  to  define  ‘the  game  arising  among  themselves’.  Different 
definitions  correspond  to  different  solution  rules.  Put  differently,  there  are  many 
ways  to  define  reduced  games,  leading  to  many  different  ‘reduced  game  properties’ 
as  specific  instances  of  the  above  general  game  theoretic  principle.  Instead  of 
‘reduced  game  property’  also  the  term  ‘consistency’  is  used.  This  concept  has  been 
very  fruitful  over  the  past  decades — in  cooperative  as  well  as  noncooperative  game 
theory.  (For  a  reduced  game  approach  to  Nash  equilibrium  see  Sect.  13.8.) 

For  the  Shapley  value  the  following  reduced  game  turns  out  to  be  relevant.  It  is 
the  reduced  game  applied  in  the  example  above. 

Definition  17.15  Let  i/^bea  value,  assigning  to  each  ( N ,  v)  e  Q  a  vector  v )  £ 

RN.  For  (N,  v)  e  Q  and  U  c  N,  U  ^  0,  the  game  ( N  \  U,  vu,f)  is  defined  by 
=  o  and 

Vu,f(S)  :=  v(S  U  U)-J2  MS  U  U,  v)  for  all  S  e  2nXu  \  {0}  . 

keu 

vjjrf  is  called  the  ( U ,  \//)-reduced  game  of  v.  □ 

Thus,  the  worth  of  coalition  S  in  the  game  vv,f  is  obtained  by  subtracting  from  the 
worth  of  S  U  U  in  the  game  (S  U  U,  v)  the  payoffs  of  the  players  in  U  according  to 
the  value  ifr.  The  reduced  game  property  or  consistency  property  for  a  value  x/z  on  Q 
based  on  this  reduced  game  is  the  following. 

HM-consistency  (HMC)  :  for  all  games  ( N ,  v)  and  all  U  c  N,  U  ^  0 

fi(N  \  U,  vUtf)  =  v)  for  all  i  e  N  \  U  .  (17.20) 

The  following  lemma  is  used  in  the  proof  of  HM-consistency  of  the  Shapley  value. 


4 Named  after  Hart  and  Mas-Colell. 
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Lemma  17.16  Let  (N,  v )  G  Q.  Suppose  Q  :  2N  ->  R  satisfies 

J2(Q(S>  ~  Q(S  \  {*}))  =  v(S)/or  each  S  e  2W  \  {0}  . 

ieS 

Then  for  each  S  G  2^: 


G(S)  =  m  V)  +  G(0)  •  (17.21) 

Proof  The  proof  of  (17.21)  is  by  induction  on  \S\.  Obviously,  (17.21)  holds  if 
|S|  =  0.  Take  T  with  \T\  >  0  and  suppose  (17.21)  holds  for  all  S  with  |S|  <  \T\. 
Then 


Q(T)  =  \T\-\v{T)  +  YJQ(T\m 

ieT 

=  | T\~l(v(T)  +  \T\Q(0)  +  \  {i},  w)) 

ieT 

=  g(0)  +  +  J2P(T'\  to.  o) 

ieT 

=  G(0)  +  P(r,  v) , 

where  the  last  equality  follows  from  (17.18).  ■ 

Lemma  17.17  The  Shapley  value  0  is  HM-consistent. 

Proof  Take  (N,  v)  in  Q,  and  0  U  c  N.  We  have  to  prove  that 

®i(N  \  U,  vu&)  =  ®i(N,  v )  for  all  i  e  N  \  U  .  (17.22) 

Note  that,  in  view  of  the  definition  of  Vu,&,  efficiency  of  <£>,  and  Theorem  17.12(c), 
we  have  for  all  S  c  A/'  \  U: 

VuMS )  =  v(S  u  t/)  -  y]  ®,-(S  U  u,  v)  =  J2  ®i(S  u  U,  v) 

ieu  ieS 

=  y]P(S  U  t/,  u)  -  P((5  U(/)\  {/},  v)  .  (17.23) 

ieS 

For  S  g  2n^u  define  Q(S )  :=  P(S  U  f/,  v).  Then,  by  Lemma  17.16,  with  N\U  and 
Vjj  q>  in  the  roles  of  N  and  v,  (17.23)  implies: 


G(S)  =  PCS,  U£/,o)  +  e(0)  =  P(S,  t^o)  +  P(t/,  v) 
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for  all  S  c  N  \  U.  Hence,  by  the  definition  of  Q : 

P(S  U  U,v)=  P(S,  vUt$)  +  P(U,  v).  (17.24) 

Equation  (17.24)  and  Theorem  17.12(c)  imply 


&i(N  \  U,  w,*)  =  P(N  \  U,  w,*)  -  P((N  \  U )  \  {f},  vUt9) 

=  P(N,  v )  -  P(N  \  {i},  v)  =  v )  . 


So  (17.22)  holds.  ■ 

Call  a  value  \jr  standard  for  two-person  games  if  for  all  ({/,  /},  v)  e  Q 

fi({i ,j},  v)  =  l(u(i i,j)  +  v(i)  -  v(j)), 

v)  =  l(u(i :,j)  -  v(i)  +  v(j))  . 

Note  that  the  Shapley  value  is  standard  for  2-person  games.  It  turns  out  that  this 
property,  together  with  HM-consistency,  characterizes  the  Shapley  value. 

Theorem  17.18  Let  \jf  be  a  value  on  Q.  Then  \J/  is  standard  for  two-person  games 
and  HM-consistent  if  and  only  if  f  is  the  Shapley  value. 

Proof 

(a)  By  Lemma  17.17,  O  is  HM-consistent.  Further,  O  is  standard  for  two-person 
games. 

(b)  For  the  proof  of  the  converse,  assume  f  has  the  two  properties  in  the  theorem. 

It  will  first  be  proved  that  f  is  efficient.  For  two-person  games  this  is  true  by 
standardness.  Let  ({/},  v)  be  a  one-person  game.  To  prove  that  v)  —  u(0> 

construct  a  two-person  game  ({ij},  v*)  (with  j  i)  by  defining 

v*(j):=  0,  v*(i,j)  :=  v(j)  . 

From  the  standardness  of  f  it  follows  that 

fidij),  v *)  =  v(i),  fj((i,j),  v*)  =  0.  (17.25) 

The  definition  of  v*,  and  (17.25)  imply:  v*,  (i)  =  v*(i,j )  —  \J/j((i,j),  v*)  = 
v(i)  —  0  =  v(i).  So 


({/},  v*f)  =  ({/},  v)  . 


(17.26) 
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Then,  by  (17.26),  the  consistency  of  x/f  and  (17.25): 

v)  =  ft ({i}>  v*)  =  i ;(/)  . 

So  for  one-  and  two-person  games  the  rule  xj/  is  efficient.  Take  a  game  (S,  v) 
with  |5|  >  3  and  suppose  that  for  all  games  (T,  v)  with  \T\  <  |5|  the  rule  xfr  is 
efficient.  Take  k  e  S.  Then  the  consistency  of  ifr  implies: 

53  v^cs, v )  =  tk(s,  v)  +  53  v^-cs, 

zeS  ieS\{k) 

=  fk(S,  V)  +  53  ^i(S  \  {4.  ) 

l€S\{*} 

=  V)  +  vk,f  (S  \  {A:}) 

=  v(S)  . 

Hence,  x//  is  an  efficient  rule. 

(c)  If  a  Q  :  £/  ->  R  can  be  constructed  with  <2(0,  u)  =  0  and 

Vi(N,  v)  =  Q(N,  v)  -  Q(N  \  {/},  v)  (17.27) 

for  all  (N,  v )  in  Q  and  i  e  N,  then  by  (b)  and  Theorem  17.12(a),  Q  —  P  and  then 
x//i(N,  v )  =  <T>/ (TV,  v )  by  Theorem  17.12(c). 

Hence,  the  proof  is  complete  if  we  can  construct  such  a  Q.  To  achieve  this,  start 
with  2(0,  v)  :=  0,  Q({i},v)  :=  v(i),  Q({iJ},v)  :=  \(v{i,j)  +  v(i)  +  v(j))  and 
continue  in  a  recursive  way  as  follows.  Let  (TV,  v)  e  Q,  |TV|  >  3  and  suppose  Q 
with  property  (17.27)  has  been  defined  already  for  games  with  less  than  |TV|  players. 
Then  one  can  define  Q(N,  v)  a  if,  and  only  if 

a  —  Q(N  \  {/},  v )  =  x//i(N,  v )  for  all  i  e  N  . 

This  implies  that  the  proof  is  complete  if  it  can  be  shown  that 

fi(N,  v)  +  Q(N  \  {/},  v)  =  v)  +  Q(N  \  {/},  v)  for  all  ij  e  N  .  (17.28) 

To  prove  (17.28)  take  k  e  N\  {i,j}  ( |TV|  >  3).  Then 

x//i(N,  v)  -  x/fj(N,  v)  =  x/fi(N  \  { k },  vk,f)  ~  fj(N  \  { k },  vk,f) 

=  Q(N  \  {k},  vk#)  -  Q(N  \  {k,  i},  vktf) 

-  Q(N  \  {k},  vKf)  +  Q(N  \  {k,j},  vKf) 

=  (-Q(N  \  {K  i},  +  Q(N  \  {i,j,  k},  vKf)) 
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+  Q(N  \  {k,j},  vk,f)  -  Q(N  \  {i,j,  k},  vKf) 

=  -  fj(N  \  {k,  i},  vktf)  +  fi(N  \  {k,j},  vktf) 

=  -  ifj(N  \  {/},  v)  +  fi(N  \  {/},  v) 

=  —  Q(N  \  {f},  v)  +  Q(N  \  { i,f },  v) 

+  Q(N  \  {/},  v)  -  Q(N  \  { i,j },  v) 

=  Q(N  \  {/},  v)  -  Q(N  \  {/},  v) , 

where  HM-consistency  of  \f/  is  used  in  the  first  and  fifth  equality  and  (17.27)  for 
subgames  in  the  second,  fourth  and  sixth  equality. 

This  proves  (17.28).  ■ 


17.4  Problems 

17.1.  The  Games  1  j 

(a)  Show  that  {It  €  QN  \  T  e  2N  \  {0}}  is  a  basis  for  QN . 

(b)  Show  that  there  is  no  null-player  in  a  game  (TV,  It)  (T  ^  0). 

(c)  Determine  the  Shapley  value  <E>(Af,  1  r). 

17.2.  Unanimity  Games 

(a)  Prove  that  the  collection  of  unanimity  games  {uT  \  T  e  2^\{0}}isa 
basis  for  QN .  [Hint:  In  view  of  Problem  17.1(a)  it  is  sufficient  to  show  linear 
independence.] 

(b)  Prove  that  for  each  game  v  E  QN: 

v  =  CtUt  with  ct  —  (— l)^-^i?(S). 

r€2^\{0}  S:S^T 

17.3.  If- Part  of  Theorem  IV A 

Show  that  the  Shapley  value  satisfies  EFF,  NP,  SYM,  and  ADD. 

17.4.  Dummy  Player  Property  and  Anonymity 

Show  that  DUM  implies  NP,  and  that  AN  implies  SYM,  but  that  the  converses  of 
these  implications  do  not  hold.  Show  that  the  Shapley  value  has  the  dummy  player 
property. 

17.5.  Shapley  Value,  Core,  and  Imputation  Set 

Show  that  the  Shapley  value  of  a  game  does  not  have  to  be  an  element  of  the  core  or 
of  the  imputation  set,  even  if  these  sets  are  non-empty.  How  about  the  case  of  two 
players? 
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17.6.  Shapley  Value  as  a  Projection 

The  Shapley  value  :  QN  can  be  seen  as  a  map  from  QN  to  the  space  AN  of 

additive  games  by  identifying  with  AN.  Prove  that  O  :  QN  ->  AN  is  a  projection 
i.e.  O  o  0  =  O. 

17.7.  Shapley  Value  of  Dual  Game 

The  dual  game  (N,  v*)  of  a  game  (TV,  v)  is  defined  by  v*(S)  —  v(N)  —  v(N\S)  for 
every  S  c  N.  Prove  that  the  Shapley  value  of  v*  is  equal  to  the  Shapley  value  of  v. 
[Hint:  If  v  =  JfarUr,  then  u*  =  Jfa tu^.] 

17.8.  Multilinear  Extension 

(a)  Show  that  /  in  (17.14)  is  a  multilinear  function.  [A  function  g  :  W1  ->  R  is 

called  multilinear  if  g  is  of  the  form  g(x)  =  J2tcznct  f°r  arbitrary 

real  numbers  cT .] 

(b)  Prove  that  /  is  the  unique  multilinear  extension  of  v  to  [0,  1]^. 

17.9.  The  Beta-Integral  Formula 

Prove  the  beta-integral  formula  used  in  the  proof  of  Theorem  17.9. 

17.10.  Path  Independence  of  0 

Let  ( N ,  v)  G  QN  with  N  —  {1,2 , ,n}.  Prove  that  for  each  permutation  r  :  N  ->  N 
we  have 


y]<DrW({r(l),r(2), 
k=  1 


t(A:)},d)  =  7>t({l,  2,.  ..,&},«)  . 

t=i 


17.11.  An  Alternative  Characterization  of  the  Shapley  Value 

Let  \j/  be  a  value  on  Prove:  f  —  O  if  and  only  if  f  has  the  following  four 

properties. 

(a)  xfr  is  HM-consistent. 

(b)  f  is  efficient  for  two-person  games. 

(c)  f  is  anonymous  for  two-person  games. 

(d)  f  is  relative  invariant  w.r.t.  strategic  equivalence  for  two-person  games.  [This 

means:  (ffv),  fj(vj)  —  aif/fv),  +  (fr,  ff)  whenever  v(S)  —  av(S)  + 

JfjeS  Pi  for  every  coalition  S ,  where  a  >  0  and  pi,  Pj  G  R.] 


1 7.5  Notes 

The  Shapley  value  was  introduced  in  Shapley  (1953).  This  paper  is  the  starting  point 
of  a  large  literature  on  this  solution  concept  and  related  concepts.  For  overviews  see 
Roth  (1988)  and  Chaps.  53  and  54  in  Aumann  and  Hart  (2002). 
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The  coalitional  dividend  approach  is  due  to  Harsanyi  (1959).  The  axiomatization 
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In  Chap.  17  we  have  seen  that  the  Shapley  value  of  a  game  does  not  have  to  be 
in  the  core  of  the  game,  nor  even  an  imputation  (Problem  17.5).  In  this  chapter 
we  introduce  a  set- valued  extension  of  the  Shapley  value,  the  Weber  set,  and  show 
that  it  always  contains  the  core  (Sect.  18.1).  Next,  we  study  so-called  convex  games 
and  show  that  these  are  exactly  those  games  for  which  the  core  and  the  Weber  set 
coincide.  Hence,  for  such  games  the  Shapley  value  is  an  attractive  core  selection 
(Sect.  18.2).  Finally,  we  study  random  order  values  (Sect.  18.3),  which  fill  out  the 
Weber  set,  and  the  subset  of  weighted  Shapley  values,  which  still  cover  the  core 
(Sect.  18.4). 


18.1  The  Weber  Set 


Let  (N,  v)  e  QN .  Recall  the  definition  of  a  marginal  vector  from  Sect.  17.1. 

Definition  18.1  The  Weber  set  of  a  game  (N,  v )  £  QN  is  the  convex  hull  of  its 
marginal  vectors: 

W(v )  :=  con \{m°  (v)  \  a  £  II  (/V)}  . 


□ 


Example  18.2  Consider  the  three-person  game  ({1,2, 3},  v)  defined  by  v (12)  = 
u(13)  =  1,  v(23)  =  —1,  t; (123)  =  3,  and  v(i)  —  0  for  every  i  £  {1,2,3}.  The 
marginal  vectors  of  this  game,  the  core  and  the  Weber  set  are  given  in  Fig.  18.1.  □ 

We  show  now  that  the  core  is  always  a  subset  of  the  Weber  set. 

Theorem  18.3  Let  (N,  v)  £  QN .  Then  C(v )  c  W(v). 
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(ct(1),  ct(2),  a(3)) 

m\ 

m°2 

ml 

(1,2,3) 

0 

1 

2 

(1,3,2) 

0 

2 

1 

(2,1,3) 

1 

0 

2 

(2,3,1) 

4 

0 

-1 

(3,1,2) 

1 

2 

0 

(3,2,1) 

4 

-1 

0 

(0,-1, 4) 


(5,-1, 


(0,4,— 1) 


Fig.  18.1  Example  18.2.  The  core  is  the  convex  hull  of  the  vectors  (3,  0,  0),  (1, 2,  0),  (0,  2, 1), 
(0,  1,2),  and  (1,  0,  2).  The  Weber  set  is  the  convex  hull  of  the  six  marginal  vectors 


Proof  Suppose  there  is  an  x  G  C(v )  \  W(v).  By  a  separation  theorem  (Theo¬ 
rem  22.1),  there  exists  a  vector  y  g  RN  such  that  w  •  y  >  x  •  y  for  every  w  g  W(v). 
In  particular, 


m°  •  y  >  x  •  y  for  every  o  G  H(N)  . 


(18.1) 


Let  n  e  n(V)  with  yn(l)  >  yJT(2)  >...>  yn(n >•  Since  x  e  C{v), 

n 

mK  ■  y  =  n(  2), . . . ,  n(i))  -  n(^(l),  n(  2),  1))) 

i=  1 

n—  1 

=  yn(n)v{N )  -yn{  1)U(0)  +  —  yn(i+\))v(jt(l),  n{2) . . . ,  n{i)) 

i=  1 

n  n—  1  i 

—  yn(n)  ^^XJT(j)  +  —  yn(i+l)) 
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n  i  n  i—  1 

=  y*{  o  _  y^3;7r(o 

*=1  7=1  *'=2  7=1 

n 

—  Ew,(o =  x  y 

/=  i 

contradicting  (18.1).  ■ 


1 8.2  Convex  Games 

For  the  coincidence  of  core  and  Weber  set  the  following  possible  property  of  a  game 
plays  a  crucial  role. 

Definition  18.4  A  TU-game  (A,  u)  is  convex  if  the  following  condition  holds  for 
all  S,T  c  A: 


u(S)  +  u(T)  <  v(S  U  T)  +  u(5  n  F)  .  (18.2) 

□ 

Observe  that  convexity  of  a  game  implies  super- additivity  [cf.  (16.2)]:  v  is  super¬ 
additive  if  (18.2)  holds  whenever  S  and  T  have  empty  intersection.  The  intuition 
is  similar:  larger  coalitions  have  a  relatively  larger  worth.  This  intuition  is  also 
apparent  in  the  following  condition: 

For  all  i  e  A  and  S  c  T  c  A  \  {/}:  v(S  U  i)  -  v(S)  <  u(T  U  i)  -  v(T)  .  (18.3) 

Lemma  18.5  A  game  (A,  v)  e  QN  is  convex  if  and  only  if  it  satisfies  (18.3). 

Proof  Let  v  e  QN .  Obviously,  (18.3)  follows  from  (18.2)  by  taking,  for  S  and  T  in 
(18.2),  S  U  i  and  T  from  (18.3),  respectively. 

In  order  to  derive  (18.2)  from  (18.3),  first  take  So  c  To  c  A  and  KA\f0,  say 
7?  =  By  repeated  application  of  (18.3)  one  obtains 

v(S0  U  4)  -  v(S0)  <  v(r0  U  f)  -  v(T0) 
v(S0  U  44)  —  v(So  U  4)  <  v(T0  U  44)  ~  ^(To  U  4) 


U  4  •  •  •  4)  —  i?(So  U  4  •  •  •  4-i)  5  u(T)  U  4  •  •  •  4)  —  ^  (To  U  4  •  •  •  4-i)  • 
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Adding  these  inequalities  yields 

v(S0  UR)-  v(S0)  <  v(T0  UR)-  v(T0) 

for  all  R  c  N  \  To.  Applying  this  inequality  to  arbitrary  S,  T  by  setting  So  =  S  H  T, 
To  —  T,  and  R  —  S\T,  yields  (18.2).  ■ 

The  importance  of  convexity  of  a  game  for  the  relation  between  the  core  and  the 
Weber  set  follows  from  the  following  theorem. 

Theorem  18.6  Let  v  e  QN.  Then  v  is  convex  if  and  only  if  C(v)  —  W(v). 

Proof 

(a)  Suppose  v  is  convex.  For  the  ‘only  if’  part  it  is,  in  view  of  Theorem  18.3  and 
convexity  of  the  core,  sufficient  to  prove  that  each  marginal  vector  mn  ( v )  of 
v  is  in  the  core.  In  order  to  show  this,  assume  for  notational  simplicity  that 
jt  is  identity.  Let  S  c  N  be  an  arbitrary  coalition,  say  S  —  {i\, ... ,  is)  with 
i\  <  ...  <  is .  Then,  for  1  <  k  <  s,  by  (18.3): 

v(iu. . .  ,4)  -  v(i\, . . .  ,4_i)  <  v{\,2, ...  ,4)  -  v(l,2, ...  ,ik  -  1)  =  m?(v)  . 
Summing  these  inequalities  from  k  —  1  to  k  =  s  yields 

s 

v(S)  =  v(iu...,is)  <  {v)  , 

k=  1  ieS 


which  shows  m71  (v)  e  C(v). 

(b)  For  the  converse,  suppose  that  all  marginal  vectors  of  v  are  in  the  core.  Let 
S,T  c  N  be  arbitrary.  Order  the  players  of  N  as  follows: 

N  =  {iu...  •>  ik,  lk-\- 1 ?  •  •  •  ?  Hi  L+i ?  •  •  •  j  L+i ?•••■»  L}  • 

S - V - "  " - V - ^  S - V - '  S - V - ^ 

Sr)T  T\S  S\T  N\(SUT ) 

This  defines  an  ordering  or  permutation  tt,  namely  by  7t(j)  —  ij  for  all  j  e  N , 
with  corresponding  marginal  vector  m(v)  —  m7r(u).  Since  m(v)  e  C(u), 

v) 

ieS 

k  s 

=  +  L 

i=i  ;'=<+! 

—  u(/i,  •  •  • ,  4)  + 
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[u(/i, . . . ,  h+i)  —  v(i\, . . . ,  iff]  + 

,  h+i)  ~  v(i\, . . . ,  k+i]  +  . . . 

[v(iu  . . . ,  is)  —  v(i\, . . . ,  4-i] 

=  v(S  fl  T)  +  v(S  U  T)-  v(T)  , 

which  implies  (18.2).Sofisa  convex  game.  ■ 

An  immediate  consequence  of  Theorem  18.6  and  the  definition  of  the  Shapley  value 
(Definition  17.1)  is  the  following  corollary. 

Corollary  18.7  Let  v  E  QN  be  convex.  Then  d>(u)  e  C(v),  i.e.,  the  Shapley  value 
is  in  the  core. 


1 8.3  Random  Order  Values 


A  value  x/r  :  QN  ->  RN  is  called  a  random  order  value  if  there  is  a  probability 
distribution  p  over  the  set  of  permutations  II  (N)  of  N  such  that 

x/s(N,  v)  =  p{n)mn(N,v) 

7i  g  n  (a^) 


for  every  (N,  v)  e  QN.  In  that  case,  we  denote  if/  by  Observe  that  is  the 
Shapley  value  d>  if  p( n)  —  l/n\  for  every  n  e  IT (N).  Obviously, 


W(v)  —  {x  e  Rn  |  x  =  Op(u)  for  some  p}. 


Random  order  values  satisfy  the  following  two  conditions. 

Monotonicity  (MON):  >0  for  all  monotonic  games  v  e  QN .  [The  game  v  is 

monotonic  ifS^T  implies  v(S)  <  v(T)  for  all  S,T  c  N.] 

Linearity  (LIN):  \//(av  +  /3w )  =  a\/f(v)  +  for  all  v,  w  e  QN ,  and  a,  e  R 

[where,  for  each  S ,  (av  +  —  av(S)  +  /3w(S )]. 

Monotonicity  says  that  in  a  monotonic  game,  where  larger  coalitions  have  higher 
worths,  i.e.,  all  marginal  contributions  are  nonnegative,  every  player  should  receive  a 
nonnegative  payoff.  Linearity  is  a  strengthening  of  additivity.  The  main  result  in  this 
section  is  the  following  characterization  of  random  order  values.  (See  Problem  18.7 
for  a  strengthening  of  the  ‘only  if’  part  of  this  theorem.) 

Theorem  18.8  Let  :  QN  — >  M.N  be  a  value.  Then  if/  satisfies  LIN ,  DUM,  EFF, 
and  MON  if  and  only  if  it  is  a  random  order  value. 
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The  proof  of  Theorem  18.8  is  based  on  a  sequence  of  propositions  and  lemmas, 
which  are  of  independent  interest. 


Proposition  18.9  Tet  \[f  \  Q  — ^  R  be  ci  linear  value.  Then  there  is  a  collection  of 
constants  {a\  £  R  |  i  £  N,  0  ^  T  c  N}  such  that  ffv)  —  alTv(T)  for 

every  v  £  QN  and  i  £  N. 


Proof  Let  alT  xf/fl  t)  for  all  i  £  N  and  0  /  fC]V  (cf.  Problem  17.1).  For  every 
v  £  QN  we  have  v  —  v(T)\t.  The  desired  conclusion  follows  from  linearity 

of  \jr.  ■ 

Proposition  18.10  Tet  \f/  .  y  R  be  a  linear  value  satisfying  DU]\T.  Then  there 
is  a  collection  of  constants  {plT  £  R  |  i  G  N,  T  c  7V\  /}  with  \  for  all 

i  G  A,  such  that  for  every  v  £  QN  and  every  i  G  A/': 

V^)  =  X!  M-Wr  u  0  -  «(r)]  • 

TQN\i 


Proof  Let  v  G  QN  and  i  £  N.  By  Proposition  18.9  there  are  alT  such  that  ffv)  — 
aTv(T) .  Then  0  =  fi(uN\i)  —  alN  +  where  the  first  equality  follows 

from  DUM.  Assume  now  as  induction  hypothesis  that  alTUi  -\-alT  —  0  for  all  T  c  N\i 
with  \T\  >  k  >  2  (we  have  just  established  this  for  k  —  n  —  1),  and  let  S  c  A"  \  i 
with  |Sj  —  k  —  1 .  Then 


0  =  ffus) 

—  alT 
T:S^T 

~  (aTUi  +  aT )  +  aSUi  + 

T:S£T^N\i 


where  the  last  equality  follows  by  induction  and  the  first  one  by  DUM.  Hence,  we 
have  proved  that  alTUi  +  alT  =  0  for  all  T  c  N\i  with  0  <  \T\  <  n  —  1.  Now  define, 
for  all  i  £  N  and  all  such  T,  p\  alTUi  =  —alT,  and  define  pl&  a1^.  Then  for 

every  v  £  QN  and  i  £  N: 

Mv)  =  y]  aTv(r) 

(d^T^N 

—  y  ]  citv(T)  +  y  '  aTv(T) 

T:  ieT  &^TQN\i 
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—  y  '  aTUiv(T  U  i)  y  '  (  aT)v(T) 

T:  i£T  TQN\i 

=  Y  p'Mt  i)  -  v(T)} . 

TQN\i 


Finally,  by  DUM, 


1  =  (0 


tiim)  =  Y  p‘t  ’ 

TC.N\i 


which  completes  the  proof  of  the  proposition.  ■ 

Proposition  18.11  Let  \fs  \  — >  Rn  be  ci  linear  value  satisfying  DUM  and  MON. 

Then  there  is  a  collection  of  constants  {plT  G  M  |  i  G  N,  T  c  N  \  i}  with 
N\iPr  ~  1  aRd  Ps  —  0  for  all  S  c  AT  \  /  and  i  G  TV,  sned  that  for  every 
v  G  QN  and  every  i  G  AT: 

=  e  u  o  - 1,(7')] . 

T^N\i 


Proof  In  view  of  Proposition  18.10  we  only  have  to  prove  that  the  weights  p\  are 
nonnegative.  Let  i  G  AT  and  T  c  AT  \  i  and  consider  the  game  assigning  worth  1 
to  all  strict  supersets  of  T  and  0  otherwise.  Then  ffuj)  =  Ssc#v/75|wr(S'  U  /)  — 
^(S)]  by  Proposition  18.10,  hence  ffuj)  —  plT .  Since  is  a  monotonic  game, 
this  implies  plT  >  0.  ■ 


Lemma  18.12  f  :  QN  ^  Mr  a  va/n£  and  {/?*r  G  M  |  i  G  AT,  T  Y  N  \  i}  be  a 
collection  of  constants  such  that  for  every  v  e  QN  and  every  i  G  AT; 


t, t («)  =  Y  p‘tIv(t  u  o  -  v(r)]  • 

TQN\i 


Then  f  is  efficient  if  and  only  ifJfieN  plN\t  =  1  and  JfieT  p\ v  =  J2jeN\r  ^rfor  al1 

0  ^  r  ^  at. 

Proof  Let  v  G  QN .  Then 


\j/  (u)  (AT)  =  Y  E  Pr[w(ruo-j;(r)] 


=  E(  E  pt\At)~  Y  pNo 

TC.N:  i£T 


ieN  \  TCZN:  ieT 
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=  E vw 

T^N 


=  E 

T^N 


EAi  -  E  A 


ieT 


jeN\T 


Clearly,  this  implies  efficiency  of  xjr  if  the  relations  in  the  lemma  hold.  Conversely, 
suppose  that  x/r  is  efficient.  Let  0  ^  T  c  N  and  consider  the  games  ut  and 
where  u  was  defined  in  the  proof  of  Proposition  18.1 1.  Then  the  preceding  equation 
implies  that 


t{uT){N)  -  f(uT){N )  -  y>r(S)  -  MS))  E  Ps\, 

S^N  y  ieS 

~  —  ^2  • 

ieT  j£N\T 

The  relations  in  the  lemma  now  follow  by  efficiency  of  x/r,  since  ut(N)  —  uj{N )  is 
equal  to  1  if  T  —  N  and  equal  to  0  otherwise.  ■ 

We  are  now  sufficiently  equipped  to  prove  Theorem  18.8. 

Proof  of  Theorem  18.8  We  leave  it  to  the  reader  to  verify  that  random  order  values 
satisfy  the  four  axioms  in  the  theorem.  Conversely,  let  x/s  satisfy  these  four  axioms. 
By  Proposition  18.11  there  is  a  collection  of  constants  {plT  e  M  |  i  e  N,  T  c  N  \  i} 
with  J2TczN\iPT  —  1  and  Ps  —  0  for  all  S  c  N  \  i  and  i  e  N,  such  that  for  every 
v  e  QN  and  every  i  e  N: 

t,(v)  =  E  Pt[v(t  U  0  -  V(T)]  . 

TC.N\i 

For  all  i  e  N  and  T  c  N  \  i  define  A(T )  :=  J2jeN\T^T  an<^  T)  plT/A(T ) 
if  A(T)  ^  0,  A(i;T )  :=  0  if  A(T )  =  0.  For  any  permutation  n  e  II  (Af)  write 
7r  =  (q, . . . ,  /„)  (that  is,  7r(C)  =  4  for  all  k  e  N ).  Define 


pO)  =  pyAfa  {q})A(z3;  {h,i2})  •  •  -A(in;  {q, . . . ,  4-i}) 


Then  we  claim  that 


E  =  E^'0  • 

7t  e  n  (tV)  ieN 


( 1 8.4) 
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In  order  to  prove  (18.4),  note  that  when  considering  the  sum  J2nen(N)  M71)  we  may 
leave  out  any  n  with  p(jt)  =  0.  This  means  in  particular  that  in  what  follows  all 
expressions  A (•;  •)  are  positive.  Now 


X  P(x)  =  X  X  X  PgMh',  {*i})  -Mh',  {h,h})  • 

it  ell (N)  h^N  i2eN\i\  in—  i  <EN\i\-in-2 

•  •  •  A (in— l ,  ,  ifj — 2 } )  *  A(/n,  O'l  >  •  •  •  ?  in—  1 }) 

=  E  E  mmm  E  P&Mh',  0’i})  •  A(/3;  {/i,  /2})  • 

4  /2G^VVi  in— 2  €N\i\  •••/, 7-3 


A(jn— 2?  {fi  ?  •  •  •  >  in— 3}) 


4vV, 


+ 


PN\ir-in-: 


M V\« ,  "‘hi — 2  +  A.l  "'hi— 2  Ah  -4-2  +  A-!  -4-2 

XX'"  X  P0AO'2;{*'i})-^(*3;{*'i,i2}) 

4  G/V  4  £N\i I  in— 2G^VVr"i'n— 3 


•  •  •  A  (in — 2?  O'l  ?  ■  ■  ■  ?  in — 3}) 

=  X  X  -  X  P0Mh;{h})-A(i3;{iuh})- 

h^N  i2 EN\i\  in- 3 GVVi -4-4 


•  •  •  A(in— 3,  {/i , . . . ,  f/7 — 4 }) 


where  after  the  first  equality  sign,  in  e  N  \  {i\, ... ,  in-\ },  and  A(/„;  {fi, . . . ,  in-\ }) 
=  1  by  definition;  after  the  second  equality  sign  l  ,k  e  A/'  \  {fi, . . . ,  in- 2}  with 
l  ^  k\  the  third  equality  sign  follows  since  the  sum  involving  l  and  k  is  equal  to  1 ; 
the  remaining  equality  signs  follow  from  repetition  of  this  argument.  This  concludes 
the  proof  of  (18.4). 

We  next  claim  that  for  every  0  <  t  <  n  —  lwe  have 

X  X  P't  =  1  •  (i8.5) 

T:\T\=t  ieN\T 

To  prove  this,  first  let  t  —  n  —  1 .  Then  the  sum  on  the  left-hand  side  of  (18.5)  is  equal 
to  J2ieNPlN\r  w^ich  is  equal  to  1  by  Lemma  18.12.  Now  as  induction  hypothesis 
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assume  that  (18.5)  holds  for  t  +  1.  Then 

E  E  p‘tm 

ieN  T:  \T\=t+l,  ieT 

E  E  v'tm 

T:  \T\=t+l  i£T 

E  E  p‘t 

T:  \T\=t+l  ieN\T 

1  , 

where  the  second  equality  follows  by  Lemma  18.12  and  the  last  equality  by 
induction.  This  proves  (18.5).  In  particular,  for  t  =  0,  we  have  J2ieNP&  ~ 
1.  Together  with  (18.4)  this  implies  that  p( •)  as  defined  above  is  a  probability 
distribution  on  n  (TV) . 

In  order  to  complete  the  proof  of  the  theorem,  it  is  sufficient  to  show  that  — 
OL  For  every  game  v  e  QN  and  i  G  N  we  can  write 

$?(«)  =  E  u  o  -  v(p„m , 

jren(N) 

where  Pn  (i)  denotes  the  set  of  predecessors  of  player  i  under  the  permutation  jr  (cf. 
Sect.  17.1).  Hence,  it  is  sufficient  to  prove  that  for  all  i  e  N  and  T  c  N  \  i  we  have 

Pt  —  E  PW  •  (18.6) 

jren(iV):  r=M0 


E  E  p,T  - 

T:  \T\=t  ieN\T 


In  order  to  prove  (18.6),  first  let  |T|  =  t.  By  using  a  similar  argument  as  for  the 
proof  of  (18.4),  we  can  write 

E  p(jt)  =  E  E  •••  E  4a(/2; {/,})•  A(/; r) . 

jr-T=pn  (0  oGr\/i/2-o-i 

Hence, 


E 

jr.T=Pjt(i) 


Ef 


s(w\r)ui,  /Ow 


•  E 

it—  \€T\it 


0—1 

PT\i,i,-i 


-1  PT\i,i,-1 


E  ^0 

i'l  £T\it-i2 
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=  Pt 
J2jeTPT\j 

•  E  — ^  _ 

.  I]  />0 

i\  £T\it-i2 

=  PlT  • 

Here,  the  first  equality  sign  follows  from  rearranging  terms  and  substituting  the 
expressions  for  A(-;-);  the  second  equality  sign  follows  from  Lemma  18.12;  the 
final  equality  sign  follows  from  reading  the  preceding  expression  from  right  to  left, 
noting  that  the  remaining  sum  in  each  enumerator  cancels  against  the  preceding 
denominator.  ■ 


1 8.4  Weighted  Shapley  Values 

The  Shapley  value  is  a  random  order  value  that  distributes  the  dividend  of  each 
coalition  equally  among  all  the  members  of  that  coalition  (see  Theorem  17.7).  In  this 
sense,  it  treats  players  consistently  over  coalitions.  This  is  not  necessarily  the  case 
for  every  random  order  value.  To  be  specific,  consider  Example  18.2.  The  payoff 
vector  (2|,  —  1)  is  a  point  of  the  Weber  set,  namely  the  midpoint  of  the  marginal 

vectors  (4,  —1,0)  and  (1,0,  2).  Thus,  it  can  be  obtained  uniquely  as  Op(u),  where 
the  probability  distribution  p  assigns  weights  1/2  to  the  permutations  (3, 2, 1)  and 
(2,  1, 3).  In  terms  of  dividends,  the  two  marginal  vectors  can  be  written  as 

(A„(l)  +  A„(12)  +  A„(13)  +  A„(123),  A„(2)  +  A„(23),  A„(3)) 
for  the  permutation  (3,2,1)  and 

(A„(l)  +  A„(12),  Ay (2),  Ay(3)  +  A„(13)  +  Ay(23)  +  A„(123)) 

for  the  permutation  (2, 1, 3).  (Cf.  Problem  18.1,  where  this  is  generalized.)  Thus, 
Ay  (123)  is  split  equally  between  players  1  and  3,  whereas  Ay  (23)  is  split  equally 
between  players  2  and  3.  Hence,  whereas  player  2  has  zero  power  compared  to 
player  3  in  distributing  Ay  (123),  they  have  equal  power  in  distributing  Ay  (23).  In 
this  respect,  players  2  and  3  are  not  treated  consistently  by  OL 

In  order  to  formalize  the  idea  of  consistent  treatment,  we  first  define  positively 
weighted  Shapley  values.  Let  oo  e  RN  with  oo  >  0.  The  positively  weighted  Shapley 


338 


18  Core,  Shapley  Value,  and  Weber  Set 


value  is  defined  as  the  unique  linear  value  which  assigns  to  each  unanimity  game 
(N,  us) : 


®r(us) 


<j)i/o)(S)  for  i  €  S 
0  for  i  e  N  \  S  . 


(18.7) 


We  will  show  that  these  positively  weighted  Shapley  values  are  random  order  values. 
Define  independently  distributed  random  variables  Xt  (i  e  N)  on  [0, 1]  by  their 
cumulative  distribution  functions  [0,1  ]  9  m  ^  (that  is,  Xt  <  t  with  probability 
f)l  ).  Then,  define  the  probability  distribution  pM  by 


mtn—  1  rt 2 

...  / 

JO 


dUix  . . . dC\2 dC\l di 


Min-2 


(J^n  —  1 


1 


n— 2 


ft— 1 


ln 


(18.8) 


for  every  permutation  n  —  (fi,  h, . . . ,  in)-  That  is,  pM(j r)  is  defined  as  the 
probability  that  i\  comes  before  h,  h  before  z'3,  etc.,  evaluated  according  to  the 
independent  random  variables  Xt.  It  is  straightforward  to  check  that,  indeed,  pM  is  a 
probability  distribution  over  the  set  of  permutations.  Then  we  have: 


Theorem  18.13  For  every  00  e  M,N  with  00  >  0,  d>w  =  <f>p6\ 


Proof  Let  S  be  a  nonempty  coalition  and  i  e  S.  Since  and  are  linear,  it  is 

CO 

sufficient  to  prove  that  d>;  ( us )  =  cc>i  /oo(S).  Note  that 


vf(us)=  Pw(x)m?(us) 

jieU(N) 


X  p<° {n)  ’ 

7r € n (TV) :  S\i^Pn(i) 


and  the  right-hand  side  of  this  identity  is  equal  to 


1  rti 


ff 


dtw(SXi)  dtV 


Hence, 


(us)  —  f  f  dt^^dtf1  =  f  =  f  coif*®  ldt  —  C0i/ co{S) 

Jo  Jo  Jo  Jo 


which  concludes  the  proof  of  the  theorem. 


Next,  we  extend  the  concept  of  weighted  Shapley  value  to  include  zero  weights. 
Consider  for  instance,  the  three-person  random  order  value  that  puts  weight  1  /2  on 
the  permutations  (1, 2,  3)  and  (1, 3,  2).  Then  each  of  the  marginal  vectors 


(A,(l),  Av(2)  +  A,(12),  A„(3)  +  A,(13)  +  A„(23)  +  A„(123)) 
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and 


(Av(l),  A„(2)  +  Av(12)  +  Ay (23)  +  Av(123),  A„(3)  +  A„(13)) 


gets  weight  1/2.  (Cf.  again  Problem  18.1.)  The  dividend  Ay  (12)  goes  to  player 
2,  the  dividend  Ay  (13)  to  player  3,  and  the  dividends  Ay (23)  and  Ay  (123)  are 
split  equally  between  players  2  and  3.  Thus,  this  random  order  value  treats  players 
consistently  but  we  cannot  just  formalize  this  by  giving  player  1  weight  0  since 
player  1  does  obtain  Ay(l). 

To  accommodate  this  kind  of  random  order  values  we  introduce  the  concept  of  a 
weight  system.  A  weight  system  w  is  an  ordered  partition  (S i , . . . ,  Sk)  of  N  together 
with  a  vector  oo  e  M.N  such  that  co  >  0.  The  weighted  Shapley  value  <FW  is  defined 
as  the  unique  linear  value  which  assigns  to  each  unanimity  game  us  £  GN,  S  ^  0: 


cw 


cOi/co{S  fl  Sm)  for  i  e  S  fl  Sm  and  m  —  ma x{h  :  Sh  n  S  ^  0} 

0  otherwise. 

(18.9) 


Hence,  Sh  is  more  powerful  as  h  is  larger;  for  each  coalition  S  we  consider  the  subset 
of  the  most  powerful  players  S  fl  Sm,  where  m  is  the  largest  index  h  such  that  the 
intersection  of  Sh  with  S  is  nonempty,  and  they  distribute  the  dividend  of  coalition 
S  according  to  their  (relative)  weights  cOi/co(S  fl  Sm) .  Clearly,  for  k  —  1  we  obtain  a 
positively  weighted  Shapley  value  as  defined  above. 

Weighted  Shapley  values  are  again  random  order  values.  For  a  weight  system 
w  with  ordered  partition  (Si, ... ,  Sk)  we  only  assign  positive  probability  to  those 
permutations  in  which  all  players  of  Si  enter  before  all  players  of  S2,  all  players 
of  S2  enter  before  all  players  of  S3,  etc.  Given  such  a  permutation  we  can  assign 
probability  p\  (tv)  to  the  order  induced  by  7t  on  Si  in  the  same  way  as  we  did  above  in 
Eq.  (18.8);  similarly,  we  assign  probabilities  ^(tt),  •  •  •  ,Pk(^)  to  the  orders  induced 
on  S2, . . . ,  Sk,  respectively.  Then  we  define 


k 

Pw(n)  =  Y \Ph(n)  ■ 

h=  1 


W 

It  can  be  shown  again  that  <FW  =  . 

We  conclude  with  an  axiomatic  characterization  of  weighted  Shapley  values.  To 
this  end  we  consider  the  following  axiom  for  a  value  x/r  :  QN  ->  RN. 

Partnership  (PA):  \lfi(\/f(uT)(S)us)  —  x/t^ut)  for  all  S  c  T  c  N  and  all  i  e  S. 

Theorem  18.14  Let  \f/  :  QN  R.N  be  a  value.  Then  \jf  satisfies  LIN ,  DUM,  EFF} 
MON,  and  PA,  if  and  only  if  it  is  a  weighted  Shapley  value. 


Proof  Problem  18.8. 
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By  Theorem  18.3  we  know  that  the  core  of  any  game  is  included  in  the  Weber  set 
and,  thus,  in  any  game  any  core  element  corresponds  to  at  least  one  random  order 
value.  The  following  theorem,  included  here  without  a  proof,  states  that,  in  fact,  the 
core  is  always  covered  by  the  set  of  weighted  Shapley  values. 

Theorem  18.15  Let  v  e  QN  and  x  e  C(u).  Then  there  is  a  weight  system  w  such 
that  x  = 


18.5  Problems 

18.1.  Marginal  Vectors  and  Dividends 
Let  (N,  v)  e  QN . 

(a)  Show  that 


v(S)  =  J2Av(T)  (18.10) 

res 

where  A V(T)  are  the  dividends  defined  in  Sect.  17.1. 

(b)  Express  each  marginal  vector  mn  in  terms  of  dividends. 

18.2.  Convexity  and  Marginal  Vectors 

Prove  that  a  game  (N,  v )  is  convex  if  and  only  if  for  all  T  e  2N  \  {0}: 

v(T )  =  min  mf  (v)  . 

ttgikao  ^ 

ieT 


18.3.  Strictly  Convex  Games 

Call  a  game  (N,  v )  strictly  convex  if  all  inequalities  in  (18.3)  hold  strictly.  Show 
that  in  a  strictly  convex  game  all  marginal  vectors  are  different. 

18.4.  Sharing  Profits 

Consider  the  following  situation  with  n-\- 1  players.  Player  0  (the  landlord)  owns  the 
land  and  players  1, 2, . . . ,  n  are  n  identical  workers  who  own  their  labor  only.  The 
production/  :  {0, 1, . . . ,  n}  ->  R  describes  how  much  is  produced  by  the  workers. 
Assume  that  /  is  nondecreasing  and  that/(0)  =  0.  We  associate  with  this  situation 
a  TU-game  that  reflects  the  production  possibilities  of  coalitions.  Without  agent  0  a 
coalition  has  zero  worth,  otherwise  the  worth  depends  on  the  number  of  workers. 


18.6  Notes 
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More  precisely, 


(0  ifO  £S 

|/(|S|-l)if0eS 


for  every  coalition  S'  c  {0,  1 , . . . ,  n}. 

(a)  Compute  the  marginal  vectors  and  the  Shapley  value  of  this  game. 

(b)  Compute  the  core  of  this  game. 

(c)  Give  a  necessary  and  sufficient  condition  on/  such  that  the  game  is  convex.  [So 
in  that  case,  the  core  and  the  Weber  set  coincide  and  the  Shapley  value  is  in  the 
core.] 

18.5.  Sharing  Costs 

Suppose  that  n  airlines  share  the  cost  of  a  runway.  To  serve  the  planes  of  company 
i,  the  length  of  the  runway  must  be  c;,  which  is  also  the  cost  of  a  runway  of  that 
length.  Assume  0  <  c\  <  C2  5  . . .  <  cn.  The  cost  of  coalition  S  is  defined  as 
cs  —  ma xies  Ci  for  every  nonempty  coalition  S. 

(a)  Model  this  situation  as  a  cost  savings  game  (cf.  the  three  communities  game  in 
Chap.  1). 

(b)  Show  that  the  resulting  game  is  convex,  and  compute  the  marginal  vectors,  the 
Shapley  value,  and  the  core. 

18.6.  Independence  of  the  Axioms  in  Theorem  18.8 
Show  that  the  axioms  in  Theorem  18.8  are  independent. 

18.7.  Null-Player  in  Theorem  18.8 

Show  that  Theorem  18.8  still  holds  if  DUM  is  replaced  by  NP. 

18.8.  Characterization  of  Weighted  Shapley  Values 

Prove  Theorem  18.14.  Also  show  that  the  axioms  are  independent. 

18.9.  Core  and  Weighted  Shapley  Values  in  Example  18.2 

In  Example  18.2,  determine  for  each  x  e  C(v)  a  weight  system  w  such  that  x  = 

Ow(V). 


1 8.6  Notes 

The  Weber  set  was  introduced  by  Weber  (1988),  who  also  gave  a  proof  of 
Theorem  18.3  (the  core  is  always  a  subset  of  the  Weber  set);  our  proof  is  due  to 
Derks  (1992).  Theorem  18.6  (coincidence  of  the  core  and  Weber  set  of  convex 
games)  is  from  Shapley  (1971)  and  Ichiishi  (1981).  The  proof  of  Theorem  18.13 
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(positively  weighted  Shapley  values  are  random  order  values)  is  from  Owen  (1972). 
Theorem  18.14  (axiomatic  characterization  of  weighted  Shapley  values)  is  from 
Kalai  and  Samet  (1987),  see  also  Derks  et  al.  (2000).  Theorem  18.15  is  due  to 
Monderer  et  al.  (1992). 

Problem  18.5  is  based  on  Littlechild  and  Owen  (1974). 
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The  core  of  a  game  with  transferable  utility  can  be  a  large  set,  but  it  can  also  be 
empty.  The  Shapley  value  assigns  to  each  game  a  unique  point,  which,  however, 
does  not  have  to  be  in  the  core.  The  nucleolus  assigns  to  each  game  with  a  nonempty 
imputation  set  a  unique  element  of  that  imputation  set;  moreover,  this  element  is  in 
the  core  if  the  core  of  the  game  is  nonempty.  The  pre-nucleolus  exists  for  every 
essential  game  (and  does  not  have  to  be  an  imputation,  even  if  the  imputation  set  is 
nonempty),  but  for  balanced  games  it  coincides  with  the  nucleolus. 

In  this  chapter  we  consider  both  the  nucleolus  and  the  pre-nucleolus.  The  reader 
may  want  to  read  the  relevant  part  of  Chap.  9  first. 

In  Sect.  19.1  we  start  with  an  example  illustrating  the  (pre-)nucleolus  and 
Kohlberg’s  balancedness  criterion.  Section  19.2  introduces  the  lexicographic  order, 
on  which  the  definition  of  the  (pre-)nucleolus  in  Sect.  19.3  is  based.  Section  19.4 
presents  the  Kohlberg  criterion,  which  is  a  characterization  of  the  (pre-)nucleolus  in 
terms  of  balanced  collections  of  coalitions.  Computational  aspects  are  discussed  in 
Sect.  19.5,  while  Sect.  19.6  presents  Sobolev’s  characterization  of  the  pre-nucleolus 
based  on  a  reduced  game  property. 


19.1  An  Example 

Consider  the  three-person  TU-game  given  by  Table  19.1.  It  is  easy  to  see  that 
this  game  has  a  nonempty  core;  for  instance,  (8,8,8)  is  a  core  element.  Let 
x  =  (x\,X2,X3)  be  an  arbitrary  efficient  payoff  distribution.  For  a  nonempty 
coalition  S',  define  the  excess  of  S  at  x  as  e(S,  x)  :=  v(S)  —  x(S).  For  an  efficient 
vector  x  the  excess  of  the  grand  coalition  N  is  always  zero,  and  is  therefore 
omitted  from  consideration.  The  idea  underlying  the  nucleolus  is  as  follows.  For 
an  arbitrary  efficient  vector  consider  the  corresponding  vector  of  excesses.  Among 
all  imputations  [and  for  the  pre-nucleolus:  among  all  efficient  vectors]  find  those 
where  the  maximal  excess  is  minimal.  If  this  set  consists  of  one  point,  then  this  is  the 
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Table  19.1  The  example  of  Sect.  19.1 


0 

{1} 

{2} 

{3} 

{1,2} 

{1,3} 

{2,3} 

{1,2,3} 

v(S) 

0 

4 

4 

4 

8 

12 

16 

24 

e(S,  (8, 8,  8)) 

-4 

-4 

-4 

-8 

-4 

0 

e(S,  (6,  9,  9)) 

-2 

-5 

-5 

-7 

-3 

-2 

e(S,  (6, 8, 10)) 

-2 

-4 

-6 

-6 

-4 

-2 

(pre-)nucleolus.  Otherwise,  determine  the  coalitions  for  which  this  maximal  excess 
cannot  be  further  decreased,  and  repeat  the  procedure  for  the  remaining  coalitions, 
keeping  the  excesses  of  the  other  coalitions  fixed.  By  successively  minimizing 
maximal  excesses,  the  (pre-)nucleolus  results.  Note  that  if  a  game  has  a  nonempty 
core  then  every  core  element  has  by  definition  only  non-positive  excesses,  whereas 
efficient  payoff  vectors  outside  the  core  have  at  least  one  positive  excess.  This 
implies  that  for  balanced  games  the  successive  minimization  of  excesses  can  be 
restricted  to  core  elements,  and  the  pre-nucleolus  and  the  nucleolus  coincide. 

In  order  to  illustrate  these  ideas,  consider  again  Table  19.1,  where  the  excesses 
of  some  core  vectors  for  this  example  are  calculated.  The  highest  excess  for  the 
core  vector  (8,  8,  8)  is  equal  to  zero,  attained  for  the  coalition  {2,3}.  Obviously,  this 
excess  can  be  decreased  by  increasing  the  payoff  for  players  2  and  3  together,  at 
the  expense  of  player  1,  who  has  an  excess  of  —4.  Thus,  a  next  ‘try’  is  the  payoff 
vector  (6,  9,  9),  which  indeed  has  maximal  excess  —2  reached  for  coalitions  {1}  and 
{2,  3}.  It  is  then  obvious  that  this  is  indeed  the  minimal  maximal  excess,  because  the 
excess  for  coalition  {1}  can  only  be  decreased  by  increasing  the  excess  for  {2,  3}, 
and  vice  versa.  In  particular,  this  implies  that  the  excesses  of  these  two  coalitions 
will  indeed  be  fixed  at  —2.  Observe  that  the  collection  {{1},  {2,  3}}  is  balanced  (in 
particular,  it  is  a  partition).  At  (6,  9,  9)  the  second  maximal  excess  is  equal  to  —3, 
reached  by  the  coalition  {1,3}.  Again,  this  might  be  decreased  by  improving  the 
payoff  for  players  1  and  3  together  at  the  expense  of  player  2.  Because  the  payoff 
for  player  1  has  already  been  fixed  at  6,  this  means  that  the  payoff  for  player  3  has  to 
be  increased  and  that  of  player  2  has  to  be  decreased.  These  observations  lead  to  the 
next  ‘try’  (6,8,  10),  where  the  maximal  excess  is  still  equal  to  —2,  and  the  second 
maximal  excess  equals  —4,  reached  by  the  coalitions  {2}  and  {1,3}.  It  is  obvious 
that  this  second  maximal  excess,  as  well  as  the  third  maximal  excess  of  —6,  cannot 
be  decreased  any  further.  Observe  that  also  the  collections  {{1},  {2,  3},  {2},  {1,3}} 
and  {{1},  {2,  3},  {2},  {1,3},  {3},  {1,2}}  are  all  balanced. 

It  follows  that  (6,  8, 10)  is  the  (pre-)nucleolus  of  this  game.  Moreover,  the 
excesses  are  closely  related  to  balanced  collections  of  coalitions;  this  will  appear 
to  be  a  more  general  phenomenon,  known  as  the  Kohlberg  criterion. 


!See  Chap.  16. 
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1 9.2  The  Lexicographic  Order 

The  definition  of  the  nucleolus  is  based  on  a  comparison  of  vectors  by  means  of 
the  lexicographic  order.  We  briefly  discuss  this  order  and  examine  some  of  its 
properties.  Let  Rk  be  the  real  vector  space  of  dimension  k.  A  binary  relation  >: 
on  R*  that  satisfies 

(1)  rehexivity:  x  >x  for  all  x  e  M.k  and 

(2)  transitivity :  x>  z  for  all  x,  y,  z  e  M.k  with  x  >:  y  and  y  >:  z 

is  called  a  partial  order.  For  a  partial  order  we  write  x  >-  y  to  indicate  that  x  >:  y 
and  y  x.  A  partial  order  >:  is  called  a  weak  order  if  it  satisfies 

(3)  completeness:  x  >:  y  or  y  >:  x  for  all  x,ye  M.k  with  x  ^  y. 

If  it  also  satisfies 

(4)  antisymmetry:  x  =  y  for  all  x,  y  e  M.k  with  x  >:  y  and  y  >:  x 
the  relation  is  called  a  linear  order. 

On  the  vector  space  Rk  we  define  the  linear  order  ^iex  as  follows.  For  any  two 
vectors  x  and  y  in  M.k,  x  is  lexicographically  larger  than  or  equal  to  y,  notation: 
x  ^iex  y,  if  either  x  =  y,  or  x  ^  y  and  for 


i  =  min {j  e  {1,  ...,k}  \  xj  ±  yj} 


we  have  that  xt  >  yt.  In  other  words,  x  should  assign  a  higher  value  than  y  to  the 
first  coordinate  at  which  x  and  y  are  different.  For  obvious  reasons  the  order  ^iex  is 
called  the  lexicographic  order  on  Rk. 

The  lexicographic  order  cannot  be  represented  by  a  continuous  utility  function 
(Problem  19.5).  In  fact,  it  can  be  shown  that  the  lexicographic  order  on  cannot 
be  represented  by  any  utility  function  (Problem  19.6). 


1 9.3  The  (Pre-)Nudeolus 

Let  ( N ,  v )  be  a  TU-game  and  let  X  c  RN  be  some  set  of  payoff  distributions.  For 
every  nonempty  coalition  S  c  Af  and  every  x  e  X  the  excess  of  S  at  x  is  the  number 

e(S,  x,v)  :=  v(S)  —  x(S)  . 


2 See  also  Chap.  11.  Here,  we  repeat  some  of  the  definitions  for  convenience. 
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This  number  can  be  interpreted  as  the  dissatisfaction  (complaint,  regret)  of  the 
coalition  S  if  x  is  the  payoff  vector.  For  every  x  £  X  let  9(x)  denote  the  vector 
of  the  excesses  of  all  nonempty  coalitions  at  x,  arranged  in  non-increasing  order, 
hence 


8(x)  =  (e(Sux,  v), . . .  ,e(S2"-i,x,  v)) 

such  that  e(St,  x,v)  >  e(Sp,x,v)  for  all  1  <  t  <  p  <  2"  —  1.  Let  ^|ex  be  the 

r\f\  1 

lexicographic  order  on  R  ,  as  defined  in  Sect.  19.2.  The  generalized  nucleolus  of 
( N ,  v )  with  respect  to  X  is  the  set 

J\f(N,  v,X )  :=  {x  £  X  |  9(y)  ^iex  0(x)  for  all  y  £  X}  . 

So  the  generalized  nucleolus  consists  of  all  payoff  vectors  in  X  at  which  the  excess 
vectors  are  lexicographically  minimized.  The  motivation  for  this  is  that  we  first  try 
to  minimize  the  dissatisfaction  of  those  coalitions  for  which  this  is  maximal,  next 
for  those  coalitions  that  have  second  maximal  excess,  etc. 

We  first  establish  conditions  under  which  the  generalized  nucleolus  with  respect 
to  a  set  X  is  non-empty. 

Theorem  19.1  Let  X  c  R^  be  nonempty  and  compact.  Then  Af(N,v,X)  is 
nonempty  and  compact  for  every  game  v. 

Proof  First  observe  that  all  excess  functions  e(S ,  •,  v)  are  continuous  and  therefore 
6(-)  is  continuous.  Define  Xq  :  —  X  and,  recursively, 

Xt  :=  {x  £  Xt-i  |  0t{ y)  >  0t(x)  for  all  y  £  Xt-X } 

for  all  t  —  1, . . . ,  2n  —  1.  Since  0(-)  is  continuous,  Weierstrass’  Theorem  implies 
that  every  Xt  is  a  non-empty  compact  subset  of  Xt-\.  This  holds  in  particular  for 
t  —  2n  —  1  and,  clearly,  X^-\  —  Af(N,  v,X).  ■ 

We  will  show  that,  if  X  is,  moreover,  convex,  then  the  generalized  nucleolus  with 
respect  to  X  consists  of  a  single  point.  We  start  with  the  following  lemma. 

Lemma  19.2  Let  X  c  R^  be  convex,  x,  y  £  X,  and  0  <  a  <  1.  Then 

aO(x)  +  (1  -  a)6(y)  ^iex  9 (ax  +  (1  -  a)y)  .  (19.1) 

Proof  Let  Si, ... ,  S^-i  be  an  ordering  of  the  non-empty  coalitions  such  that 

0( ax  +  (1  —  af)y)  =  (^(Si ,  ax  +  (1  —  a)y,  v), . . . ,  e(S 2»_i ,  ax  +  (1  —  a)y,  v ))  . 

The  right  hand  side  of  this  equation  is  equal  to  aa  +  (1  —  aOb,  where 

a  =  (e(Si,x,  v),.. .  ,e(S2n-\,x,  v))  and  b  =  (e(S\,  y,  v), . . . ,  ^(S2«-i,  y,  v)).  Since 
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9(x)  ^iex  a  and  0( y)  ^iex  b  it  follows  that 

ot6(x)  +  (1  -  a)6( y)  ^iex  aa  +  (1  -  of)b  =  9 (ax  +  (1  -  a)y)  , 

hence  (19.1)  holds.  ■ 

Theorem  19.3  Let  X  c  nonempty,  compact,  and  convex.  Then,  for  every 

game  (N,  v),  the  generalized  nucleolus  with  respect  to  X  consists  of  a  single  point. 

Proof  By  Theorem  19.1  the  generalized  nucleolus  is  nonempty.  Let  x,  y  e 
Af(N,  v,X)  and  0  <  a  <  1.  Then  0(x)  =  0( y).  Since  on  the  one  hand 

0(ax  +  (1  -  Q')y)  ^iex  6(x)  =  0(y) 

and  on  the  other  hand,  by  Lemma  19.2, 

0(x)  =  0(y)  =  aQ(x)  +  (1  -  a)0(y)  ^iex  0(ax  +  (1  -  of)y) 

antisymmetry  of  the  lexicographic  order  implies 

0(ax  +  (1  -  a)y)  =  9(x)  =  9( y)  . 

Therefore  9  (x)  =  9(y)  =  Q'a+(1— af)b,  with  a  and  b  as  in  the  proof  of  Lemma  19.2. 
Since  9(x)  ^iex  a  and  9(y)  ^iex  b  it  follows  that  a  =  9(x)  and  b  =  9(y).  In  a  and  b 
the  coalitions  are  ordered  in  the  same  way  and  therefore  this  is  also  the  case  in  0  (x) 
and  9(y).  Hence  x  and  y  have  all  excesses  equal,  and  thus  x  =  y.  ■ 

Well-known  choices  for  the  set  X  are  the  imputation  set  I(N,  v )  of  a  game  (N,  v) 
and  the  set  of  efficient  payoff  distributions  I*(N,  v)  =  (x  e  RN  \  x(N)  =  v(N)}. 
For  an  essential  game  (N,  v )  the  set  I(N,  v )  is  nonempty,  compact  and  convex, 
and  therefore  Theorem  19.3  implies  that  the  generalized  nucleolus  with  respect 
to  I(N,  v )  consists  of  a  single  point,  called  the  nucleolus  of  (N,  v),  and  denoted 
by  v(N,  v).  Although  the  set  I*(N,  v)  is  not  compact,  the  generalized  nucleolus 
of  ( N ,  v )  with  respect  to  this  set  exists  and  is  also  single- valued  (Problem  19.7): 
its  unique  member  is  called  the  pre-nucleolus  of  ( N ,  v),  denoted  by  v*(N,  v).  In 
Problem  19.8  the  reader  is  asked  to  show  that  both  points  are  in  the  core  of  the  game 
if  this  set  is  nonempty,  and  then  coincide. 


1 9.4  The  Kohlberg  Criterion 

In  this  section  we  derive  the  so-called  Kohlberg  criterion  for  the  pre-nucleolus, 
which  characterizes  this  solution  in  terms  of  balanced  sets  (cf.  Chap.  16).  A  similar 
result  can  be  derived  for  the  nucleolus  (Problem  19.9)  but  the  formulation  for  the 
pre-nucleolus  is  slightly  simpler. 
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We  start,  however,  with  Kohlberg’s  characterization  in  terms  of  side-payments. 
A  side-payment  is  a  vector  y  e  satisfying  y(N)  —  0.  Let  (N,  v )  be  a  game  and 
for  every  a  e  R  and  x  e  RN  denote  by 

V(a,  x,  v)  =  {S  c  N  \  {0}  |  x,  v)  >  a} 

the  set  of  (nonempty)  coalitions  with  excess  at  least  a  at  x. 

Theorem  19.4  Let  ( N ,  v)  be  a  game  and  x  e  7*(A,  v).  Then  the  following  two 
statements  are  equivalent. 

(a)  x  =  v*(N,  v). 

(b)  For  every  a  such  that  V(a,  x,  v)  ^  0  and  for  every  side-payment  y  with  y(S)  > 

0/or  G  22  (a,  x,  v)  we  have  y(S)  =  0/or  every  S  e  P(af,  x,  u). 

Proof  Assume  that  x  =  v*(N,  v)  and  that  the  conditions  in  (b)  are  fulfilled  for  x, 
a,  and  y.  Define  z£  =  x  +  sy  for  every  s  >  0.  Then  z£  e  I*(N,  v).  Choose  £*  >  0 
such  that,  for  all  S  e  V(a ,  x,  v)  and  nonempty  T  £  V(a,  x,  v), 

e(S,  z£* ,  v )  >  e(T ,  ze* ,  v)  .  (19.2) 

For  every  S  e  V(a ,  x,  r), 

ze* ,  u)  =  v(S)  -  (x(S)  +  e*y(S)) 

—  e(S,  x,  u)  —  £*y(S) 

<  e(S,  x,  v)  .  (19.3) 

Assume,  contrary  to  what  we  want  to  prove,  that  y(S')  >  0  for  some  S  e  V(a ,  x,  v). 
Then,  by  (19.2)  and  (19.3),  0(x)  tiex  #(ze*)  and  x  ^  ze*,  a  contradiction. 

Next,  let  x  e  I*(N,  v )  satisfy  (b).  Let  z  =  v*(N,  v).  Denote 

{e(5,  x,  v)  I  S  e  2n  \  {0}}  =  {a\,...,ap} 

with  ot\  >  •  •  •  >  otp.  Define  y  =  z  —  x.  Hence,  y  is  a  side-payment.  Since  0(x)  ^iex 
0( z),  we  have  e(S,  x,  v)  =  ot\  >  e(S,  z,  v )  for  all  S  e  V(a\ ,  x,  v )  and  thus 

e(S,  x,  v )  -  e(S,  z,  n)  =  (z  -  v)^)  =  y(S')  >  0  . 

Therefore,  by  (b),  y(S')  =  0  for  all  S  e  V(a\ ,x,v). 

Assume  now  that  y(S)  =  0  for  all  S  e  V(at,  x,  v )  for  some  1  <  t  <  p.  Then,  since 

0(X)  ^lex  0( Z), 


£(S,  x,  v )  =  cfy+i  >  e(S,  z,  u)  for  all  S  e  V(at+i,  x,  u)  \  V(at,  x,  n)  . 
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Hence  y(S )  >  0  and  thus,  by  (b),  y(S )  =  0  for  all  S  e  V(at+\,x,  v).  We  conclude 
that  y(S)  =  0  for  all  S  e  2N  \  {0},  so  y  =  0  and  x  =  z.  ■ 

We  can  now  prove  Kohlberg’ s  characterization  of  the  pre-nucleolus  by  balanced 
collections. 

Theorem  19.5  (Kohlberg)  Let  (N,v)  be  a  game  and  x  e  I*(N,v).  Then  the 
following  two  statements  are  equivalent. 

(a)  x  =  v*(A,  v). 

(b)  For  every  a,  V(a,  x,  v)  ^  0  implies  that  V(a,  x,  v )  is  a  balanced  collection. 

Proof  Assume  that  x  satisfies  (b).  Let  a  e  R  such  that  V(a,  x,  v )  ^  0,  and  let  y 
be  a  side-payment  with  y(S)  >  0  for  all  S  e  V(a,  x,  v).  Since,  by  (b),  V(a,  x,  v)  is 
balanced  there  are  numbers  A  (S')  >  0,  S  e  V(a,  x,  v),  such  that 

A  (S)es  =  eN  . 

SeD(a,x,v ) 


By  taking  the  product  on  both  sides  with  y  this  implies 

MS)y(S)  =  y(N )  =  0  . 

SeT>((x,x,v) 


Therefore  y(S)  —  0  for  every  S  e  V(a,x,  v).  Thus,  Theorem  19.4  implies  x  = 
v*(N,  v). 

Assume  next  that  x  =  v*(N,  v).  Let  a  e  R  such  that  V(ct,  x,  v )  ^  0.  Consider 
the  linear  program 

max  y($)  subject  to  —  y(S)  <  0,  S  e  V(a ,  x,  v),  andy(A)  =  0  . 

S£  T>(a,x,v) 

(19.4) 

This  program  is  feasible  and,  by  Theorem  19.4,  its  value  is  0.  Hence  (Problem  19.10) 
its  dual  is  feasible,  that  is,  there  are  A (5)  >  0,  S  e  V(a,  x,  v),  and  A  (N)  e  R  such 
that 


-  HS)es  +  X(N)eN  =  e>S  • 

S€'D(a,x,v )  SE:V(a,x,v) 

Hence  A(A)e^  =  J2seV(axv)(  1  +  A (S))es.  Since  1  +  A (S)  >  0  for  every 
S  g  V(a,  x,  v),  we  have  A  (N)  >  0  and  thus  V(a,  x,  v )  is  balanced,  with  balancing 
weights  (1  +  X(S))/X(N)  for  S  e  V(a,  x,  v).  ■ 
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1 9.5  Computation  of  the  Nucleolus 

For  two-person  games,  the  (pre-)nucleolus  is  easy  to  compute  (Problem  19.13). 
In  general,  the  computation  of  the  nucleolus  can  be  based  on  the  subsequent 
determination  of  the  sets  Xq,X\, X2, . . .  in  Theorem  19.1,  but  this  may  not  be  easy, 
as  the  following  example  shows. 

Example  19.6  Consider  the  TU-game  v  with  player  set  N  =  { 1 , 2,  3,  4}  defined  by3 


if  S  =  N 
if  S  =  {1,2} 
if  S  —  {3,4} 
if  S  =  { 1 } 
if  S  —  {3} 
otherwise. 


First  observe  that  it  is  easy  to  find  some  imputation  [e.g.,  x  =  (6,  4,  5,  5)]  such 
that  the  excesses  of  {1,2}  and  {3, 4}  are  both  equal  to  —2  and  all  other  excesses 
are  at  most  —2.  Clearly,  this  must  be  the  minimal  (over  all  imputations)  maximal 
excess  attainable,  since  decreasing  the  excess  of  { 1 , 2}  implies  increasing  the  excess 
of  {3,  4}  by  efficiency,  and  vice  versa.  Thus, 

=  {x  e  I(v)  |  0(x)i  =  -2}  , 

and  X2  =  X]  since  the  excess  of  —2  is  reached  at  the  two  coalitions  {1,2}  and 
{3,4}.  Consistently  with  the  Kohlberg  criterion,  these  coalitions  form  a  balanced 
collection.  Next,  observe  that  the  remaining  excesses  are  always  at  most  as  large  as 
at  least  one  of  the  excesses  of  the  four  one-person  coalitions.  So  we  can  find  the 
second-highest  excess  by  minimizing  a  subject  to  the  constraints 


8  —  x\  —  X2  —  —  2 

8  —  V3  —  X4  =  —2 

4  —  x\  <  a 

—X2  <  O' 

2  —  V3  <  a 
—X4  <  a  . 


3  Observe  that  this  game  has  a  non-empty  core  and  therefore  the  nucleolus  and  pre-nucleolus 
coincide  and  are  elements  of  the  core.  Cf.  Problem  19.8. 
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This  can  be  rewritten  as  the  system 


X\  T  X2  —  10 

X3  T  X4  —  10 

X]  >  4  —  a 

X2  >  —OL 
X3  >  2  —  of 
X4  >  —O'  . 

By  considering  the  equation  and  inequalities  involving  jq  and  *2,  we  obtain  a  >  —  3, 
and  those  with  V3  and  X4  yield  a  >  —4.  So  the  obvious  minimum  value  is  o'  =  —3. 
Hence,  the  next  two  coalitions  of  which  the  excesses  become  fixed  are  {1}  and  {2}, 
and,  thus,  the  nucleolus  allocates  x\  —  7  to  player  1  and  X2  —  3  to  player  2.  The 
third  step  in  the  computation  is  to  minimize  a  subject  to  the  constraints 


13  4  14  —  10 

X3  >  2  —  a 
x 4  >  —a  . 


(Note  that  the  constraints  that  only  refer  to  x\  and  v 2  have  become  superfluous.)  This 
linear  program  has  the  obvious  solution  a  —  — 4,  which  yields  X3  —  6  and  X4  =  4. 
Thus,  the  (pre-)nucleolus  of  this  game  is 

v(0  =  V*(v)  =  (7, 3,6,4)  . 

It  is  interesting  to  see  that,  even  though  at  first  glance  player  4  does  not  seem  to 
have  any  noticeable  advantage  over  player  2,  he  is  still  doing  better  in  the  nucleolus. 
This  is  due  to  the  fact  that  early  on  in  the  process  player  4  was  grouped  together 
with  player  3,  who  has  a  lower  individual  worth  than  player  1,  with  whom  player  2 
becomes  partnered.  Thus,  player  4  obtains  a  larger  slice  of  the  cake  of  size  10  that 
he  has  to  share  with  player  3,  than  player  2  does  in  a  similar  situation.  □ 

This  example  raises  the  question  how  the  nucleolus  of  a  given  TU-game  can  be 
computed  in  a  systematic  way.  More  generally,  let  (N,  v )  be  a  game.  In  order 
to  compute  the  nucleolus  Af(N,v,X)  for  a  compact  polyhedral  set  X  c  RN, 
determined  by  a  system  of  linear  (in)equalities,  we  can  start  by  solving  the  linear 
program4 


Minimize  a  subject  to  v(S)  +  a  >  v(S),  V  0  ^  S  e  2N ,  x  e  X. 


4Under  appropriate  restrictions  this  program  is  feasible  and  bounded. 
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Let  a i  denote  the  minimum  of  this  program  and  let  X\  c  X  be  the  set  of  points 
where  the  minimum  is  obtained.  If  |Xi  |  =  1  then  Af(N,  v,X)  —  X\ .  Otherwise,  let 
B\  be  the  set  of  coalitions  S  such  that  e(S,  x,  v )  =  a\  for  all  x  e  X\,  and  solve  the 
linear  program 

Minimize  a  subject  to  x(S )  +  a  >  v(S),  V  0  ^  S  e  2N  \  Bu  x  e  X\. 

Continuing  in  this  way,  we  eventually  reach  a  unique  point,  which  is  the  generalized 
nucleolus  of  v  with  respect  to  X  (cf.  Theorem  19.3).  The  following  example 
illustrates  this. 

Example  19.7  LetN  =  {1,2,  3,  4}  and  v(N)  =  100,  u(123)  =  95,  u(124)  =  85, 
u(134)  =  80,  u(234)  =  55,  v  (ij)  —  50  for  all  i  ^  j,  v(i)  =  0  for  all  i.  We  compute 
the  pre-nucleolus  (which  will  turn  out  to  be  an  imputation  and  therefore  equal  to  the 
nucleolus).  We  start  with  the  linear  program 

Minimize  a  subject  to 


X\  T  Xj  T  V3  T  a  95 
X\  T  X2  T  X4  T  a  85 

X\  T  V3  T  X4  T  a  80 

X2  T  V3  T  X4  T  a  ^  55 

X[  T  Xj  T  (X  ^  50 

Xi  +  a  >  0 

X\  T  X2  T  V3  T  X4  —  100 


for  all  ij  e  N  with  i  ^  j. 

Solving  this  program  results  in  —  10,  obtained  over  the  set  X\  given  by 


x\  -\-  X2  —  60,  x\  >  30,  X2  >  25,  V3  =  25,  *4  =  15, 


and 


B 1  =  {123,124,34}, 

which  is  a  balanced  set,  as  was  to  be  expected  (cf.  Theorem  19.5). 


5  We  write  r(123)  instead  of  t?({l,  2,  3}),  etc. 
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The  second  linear  program  is  now 
Minimize  a  subject  to 


Xl 

+ 

*3 

+ 

X4 

+ 

a 

> 

80 

x2 

+ 

X3 

+ 

X4 

+ 

a 

> 

55 

X\ 

+ 

X3 

+ 

a 

> 

50 

X\ 

+ 

X4 

+ 

a 

> 

50 

X2 

+ 

*3 

+ 

a 

> 

50 

X2 

+ 

X4 

+ 

a 

> 

50 

Xi 

+ 

a 

> 

0 

X 

G 

Xi 

for  all  i  e  N. 

By  some  simplifications  this  program  reduces  to 
Minimize  a  subject  to 


x\  +  a  >  40 
X2  T  oi  >  35 
X\  T  X2  —  60 
x\  >  30 
X2  >  25, 


with  solution  a 2  =  7.5,  jci  =  32.5,  *2  =  27.5.  Hence,  the  (pre-)nucleolus  of  this 
game  is 


v(W,  v)  =  v*(N,  v)  =  (32.5, 27.5, 25, 15), 

and  Si  =  { 123, 124,  34},  S2  =  {134, 234}.  By  Theorem  19.5  it  can  be  verified  that 
we  have  indeed  found  the  pre-nucleolus  (Problem  19.15).  □ 


1 9.6  A  Characterization  of  the  Pre-nucleolus 

The  pre-nucleolus  is  a  single- valued  solution  concept,  defined  for  any  game  (TV,  v). 
Hence,  it  is  an  example  of  a  value  on  Q  (see  Chap.  17).  In  this  section  we  provide  a 
characterization  based  on  a  reduced  game  property.  We  will  not  give  the  complete 
proof  of  this  characterization  but,  instead,  refer  the  reader  to  the  literature. 

Let  ^  be  a  value  on  Q,  and  let  x  G  RN.  Let  S  be  a  non-empty  coalition.  The 
Davis-Maschler  reduced  game  forS  at  x  is  the  game  ( S ,  VS,x)  €  QS  defined  by 

f  0  if  T  =  0 

l  v(N)  -x(N\S)  if  T  =  S 

(  maxQ^N\s  v(T  U  Q)  —  x(Q)  otherwise. 


l’.s,x  ( T)  := 
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The  interpretation  of  this  reduced  game  is  as  follows.  Suppose  x  is  the  payoff 
vector  for  the  grand  coalition.  The  coalition  S  could  renegotiate  these  payoffs 
among  themselves.  Assume  that  the  outside  players  are  happy  with  x.  Hence,  S  has 
v  ( N )  —  x(N  \  S )  to  redistribute.  Any  smaller  coalition  T ,  however,  could  cooperate 
with  zero  or  more  outside  players  and  pay  them  according  to  x:  then  vs,x(T)  as 
defined  above  is  the  maximum  they  could  get.  Hence,  the  redistribution  game  takes 
the  form  vsiX. 

The  following  axiom  for  a  value  \/f  on  Q  requires  the  outcome  of  the  redistribution 
game  for  S  to  be  equal  to  the  original  outcome. 

Davis-Maschler  consistency  (DMC):  fi(S,  vs,x)  =  'I'ii.N,  v)  for  every  ( N ,  v )  G  Q, 
0  ^  S  c  N,  x  =  \/r(N ,  v)  and  i  G  S. 

The  announced  characterization  is  based  on  two  other  axioms,  namely  Anonymity 
(AN,  see  Sect.  17.1)  and  the  following  axiom. 

Covariance  (COV):  \//(N,av  +  b)  =  cn//(N,v)  +  b  for  all  (N,v)  G  Q,  every 
a  G  R,  a  >  0,  and  every  b  G  RN,  where  (av  +  b)^)  :=  av(S)  +  b(S )  for  every 
non-empty  coalition  S  c  N. 

Remark  19.8  Covariance  requires  that  the  value  x/r  respects  strategic  equivalence  of 
games,  cf.  Problem  16.12. 


We  first  prove  that  the  pre-nucleolus  is  Davis-Maschler  consistent. 


Lemma  19.9  The  pre -nucleolus,  as  a  value  on  Q,  is  Davis-Maschler  consistent. 

Proof  Let  (N,v)  e  Q,  x  =  v*(N,v)  and  0  ^  S  c  N.  Let  xs  G  Rs  be  the 
restriction  of  x  to  S ,  then  we  have  to  prove  that  xs  —  v*(5,  vs,x)-  Let  a  e  R 
with  V(a,xs,  vs,x)  ^  0  and  let  y,s  G  Rs  be  a  side-payment  with  ys(R)  >  0  for 
every  R  G  V(a,xs,  Vs,x)  then,  in  view  of  Theorem  19.4  it  is  sufficient  to  prove  that 
Ts  (R)  —  0  for  every  R  €  ‘D(a.  xg,  i'v.x)-  We  claim  that 


{Tns 


T  e  V{a,  x,r),0^rnS^}  =  V(a,  xs,  vs,x)  \  {5}  . 


(19.5) 


To  see  this,  first  assume  T  e  V(a,  x,v),  0  ^  T  fl  S  ^  .S'.  Then 


vsAT  n  s)  -  xs(T  n  s)>  v((t  nS)u(r\  s ))  -  x(t  \  s)  -  x(t  n  s) 

=  v(T)-x(T) 

>  a 


which  proves  one  direction.  For  the  other  direction,  let  S'  G  V(a,  xs,  Vs,x),  S'  ^  S. 
Then 


max  v(S'  U  Q)  —  x(Q)  —  >  a 

Q^N\S 
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hence 


v(S'  U  Q ')  -  x(Q'  U  S')  >  a 

for  some  Q'  c  N\S;  now  take  T  —  S'  U  Q'  to  establish  the  converse  direction. 

Extend  y$  to  a  vector  y  e  M.N  by  setting  yi  —  0  for  all  i  e  N  \  S.  Then  y(N)  = 
0  and,  by  (19.5),  y(R)  >  0  for  all  R  e  V(a,x,  v).  By  Theorem  19.4,  it  follows 
that  y(R)  =  0  for  all  R  e  V(a,x,  v).  Hence,  by  (19.5),  y$(R )  =  0  for  all  R  e 
V(a,xs,  vsiX ),  which  completes  the  proof.  ■ 

As  an  additional  result,  we  prove  that  COV  and  DMC  imply  Efficiency  (EFF). 

Lemma  19.10  Let  \jf  be  a  value  on  Q  satisfying  COV  and  DMC.  Then  \jr  satisfies 
EFF. 

Proof  Let  ({/},  v)  be  a  one-person  game.  If  v(i)  =  0  then,  by  COV,  fi({i},0)  = 
f({i},  2  •  0)  =  2  fi({i},  0),  hence  f({i},  0)  =  0.  Again  by  COV, 

V^({/},  v)  =  f({i},  0  +  v)  =  f({i},  0)  +  v(i)  =  v(i)  , 

so  EFF  on  one-person  games  is  satisfied.  Now  let  ( N ,  v)  e  Q  with  at  least  two 
players.  Let  x  =  \//(N,  v )  and  i  e  N.  By  DMC,  x/  =  \//({i},  f{/}iX).  Hence,  Xi  = 
V{i}  x( i)  =  v(N)  —  x(N  \  {/}),  where  the  second  equality  follows  by  definition  of  the 
reduced  game.  Thus,  x(N)  —  v(N)  and  the  proof  is  complete.  ■ 

The  announced  characterization  of  the  pre-nucleolus  is  as  follows. 

Theorem  19.11  A  value  f  on  Q  satisfies  COV,  AN,  and  DMC  if  and  only  if  it  is  the 
pre-nucleolus. 

Proof  COV  and  AN  of  the  pre-nucleolus  follow  from  Problem  19.17.  DMC  follows 
from  Lemma  19.9.  For  the  only-if  part  see  the  Notes  to  this  chapter.  ■ 


19.7  Problems 

19.1.  Binary  Relations 

Give  an  example  of  a  relation  that  satisfies  (l)-(3),  but  not  (4)  in  Sect.  19.2.  Also 
find  an  example  that  only  violates  (3),  and  one  that  only  violates  (2).  What  about 
(1)?  Give  an  example  of  a  partial  order  that  is  neither  antisymmetric  nor  complete. 

19.2.  Linear  Orders 

Let  >_  be  a  linear  order.  Show  that  x  >-  y  holds  if  and  only  if  x  >:  y  and  x  ^  y. 
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19.3.  The  Lexicographic  Order  (1) 

Show  that  ^iex  is  indeed  a  linear  order. 

19.4.  The  Lexicographic  Order  (2) 

Find  the  set  of  points  (. X\,X2 )  in  M2  for  which  (. X\,X2 )  ^iex  (3, 1).  Draw  this  set  in 
the  Cartesian  plane.  Is  this  set  closed? 

19.5.  Representability  of  Lexicographic  Order  (1) 

Let  u:  W1  ->  R  be  a  continuous  function.  Define  >u  by  x  y  if  and  only  if 
u(x)  >  u( y).  Use  Problem  19.4  to  show  that  >u  ^  >iex. 

19.6.  Representability  of  Lexicographic  Order  (2) 

Show  that  the  lexicographic  order  cannot  be  represented  by  any  utility  function. 
[Hint:  take  the  lexicographic  order  on  M2  and  argue  that  representability  implies 
that  for  each  pair  of  real  numbers  t  and  s  we  can  find  rational  numbers  q(t)  and  q{s) 
such  that  q(t )  ^  q{s )  whenever  t  s.  Hence,  we  have  uncountably  many  different 
rational  numbers,  a  contradiction.] 

19.7.  Single-Valuedness  of  the  Pre-nucleolus 

Prove  that  the  nucleolus  of  any  game  (N,  v)  with  respect  to  /*  (TV,  v )  is  single- valued. 

19.8.  (Pre-)Nucleolus  and  Core 

Let  (TV,  v)  be  a  game  with  C(N,  v)  ^  0.  Prove  that  v(N,  v)  =  v*(N,  v)  e  C(N,  v). 

19.9.  Kohlberg  Criterion  for  the  Nucleolus 

Let  (N,  v )  be  a  game  satisfying  I(N,  v )  /  0,  and  let  x  g  I(N,  v).  Prove  that  x  = 
v(N,  v )  if  and  only  if  for  every  a  e  R:  if  V(a,  v,x )  ^  0  then  there  exists  a  set 
£(ot,  x,  v)  c  {{/}  \jeN,  xj  =  v(j)}  such  that  V(a,  x,  v )  U  S(a,  x,  v )  is  balanced. 

19.10.  Proof  of  Theorem  19.5 

In  the  proof  of  Theorem  19.5,  determine  the  dual  program  and  conclude  that  it  is 
feasible.  Hint:  use  Theorem  16.20  and  Remark  16.21. 

19.11.  Nucleolus  of  a  Three-Person  Game  (1) 

Compute  the  nucleolus  of  the  three-person  game  v  defined  by 


5  {1}  {2}  {3}  {1,2}  {1,3}  {2, 3}  {1,2,  3} 
v(S)  4  3  2  4  3  2  12 
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19.12.  Nucleolus  of  a  Three-Person  Game  (2) 

(a)  Compute  the  nucleolus  of  the  three-person  TU-game  defined  by 


S  {1}  {2}  {3}  {1,2}  {1,3}  {2,  3}  {1,2,  3} 
v(S)  0  0  1  7  5  3  10 


(b)  Make  a  graphical  representation  of  the  sets  X0 ,  X\ , . . . 

19.13.  Nucleolus  of  a  Two-Person  Game 

Compute  the  pre-nucleolus  of  a  two-person  game  and  the  nucleolus  of  an  essential 
two-person  game. 

19.14.  Individual  Rationality  Restrictions  for  the  Nucleolus 

Compute  the  nucleolus  and  the  pre-nucleolus  of  the  three-person  TU-game  defined 
by  f  (12)  =  v(\3)  —  2,  v(\23)  —  1  and  u(5)  =  0  for  all  other  coalitions. 

19.15.  Example  19.7 

Verify  that  (32.5,  27.5,25, 15)  is  indeed  the  pre-nucleolus  of  the  game  in  Exam¬ 
ple  19.7,  by  applying  Theorem  19.5. 

19.16.  (Pre-)Nucleolus  of  a  Symmetric  Game 

Let  v  be  an  essential  game.  Suppose  that  v  is  symmetric  (meaning  that  there  exists 
a  function/:  R  ->  R  such  that  v(S)  —  /(|S|)  for  every  coalition  S .) 

(a)  Prove  that  the  (pre-)nucleolus  is  symmetric,  that  is,  v(v)i  =  v(v)j  and  v*(v)i  = 
v*(v)j  for  all  players  ij  e  N.  Give  a  formula  for  the  (pre-)nucleolus. 

(b)  Suppose  that  V(a,  v(v),  v )  ^  {. N },  where  a  is  the  maximal  excess  at  the  (pre-) 
nucleolus.  Prove  thatXi  (cf.  Theorem  19.1)  is  a  singleton  set. 

19.17.  COV  and  AN  of  the  Pre-nucleolus 

Prove  that  the  pre-nucleolus  satisfies  COV  and  AN. 

19.18.  Apex  Game 

Consider  the  5-person  apex  game  ( N ,  v )  with  N  =  {1,2,  3, 4,  5}  and  u(5)  =  1 
if  1  e  S  and  \S\  >  2  or  if  \S\  >  4,  and  v(S)  =  0  otherwise.  Compute  the  (pre-) 
nucleolus  of  this  game. 

19.19.  Landlord  Game 

Consider  the  landlord  game  in  Problem  18.4. 

(a)  Assume  that  for  all  i  —  1, . . . ,  n  —  1  we  ha ve/(/)  —  /(/  —  1)  >  f(i  +  1)  —/(/). 
Show  that  the  (pre-)nucleolus  of  this  game  assigns  f(n)  —  |  \f(n)  —f(n  —  1)]  to 
the  landlord.  Compare  with  the  Shapley  value. 
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(b)  Assume  that  for  all  i  —  1 , . . . ,  n  —  1  we  have/(/)  —  f(i  —  1)  <  f(i  +1)  —/(/),  and 
that  ^f(n)  <  i  \f(n)  —f{n  —  1)].  Show  that  the  (pre-)nucleolus  of  this  game 
treats  all  players  (including  the  landlord)  equally. 

19.20.  Game  in  Sect.  19.1 

Use  the  algorithm  of  solving  successive  linear  programs  to  find  the  (pre-)nucleolus 
of  the  game  discussed  in  Sect.  19.1.  Use  Theorem  19.5  to  verify  that  the  (pre-) 
nucleolus  has  been  found. 

19.21.  The  Prekernel 

For  a  game  (. N ,  v)  define  the  pre-kernel  1C*  ( N ,  v )  c  /*  (N,  v )  by 
JC*(N,  v)  =  {x  eI*(N,  v)  |  max  e(S,  x,  v) 

S^N\{j},  ieS 

=  max  e(S,  x,  v )  for  all  ij  e  N}  . 

SQN\{i}jeS 


Prove  that  v*(N,  v)  e  1C* (N,  v). 


1 9.8  Notes 

The  nucleolus  was  introduced  in  Schmeidler  (1969).  The  treatment  in  the  present 
chapter  is  partially  based  on  the  treatment  of  the  subject  in  Peleg  and  Sudholter 
(2003)  and  Owen  (1995). 

Theorem  19.5  is  due  to  Kohlberg  (1971).  Example  19.7  is  taken  from  Owen 
(1995). 

The  Davis-Maschler  reduced  game  stems  from  Davis  and  Maschler  (1965). 

The  characterization  of  the  pre-nucleolus  in  Theorem  19.11  is  due  to  Sobolev 
(1975).  The  reader  may  also  consult  Peleg  and  Sudholter  (2003)  for  a  complete 
proof  of  the  theorem.  Snijders  (1995)  provides  a  characterization  of  the  nucleolus 
on  the  class  of  all  games  with  non-empty  imputation  set  by  modifying  the  Davis- 
Maschler  consistency  condition. 

For  the  landlord  game  in  Problem  19.19  see  also  Moulin  (1988). 
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In  this  chapter  we  consider  a  few  classes  of  games  with  transferable  utility  which 
are  derived  from  specific  economic  (or  political)  models  or  combinatorial  problems. 
In  particular,  we  study  assignment  and  permutation  games,  flow  games,  and  voting 
games. 


20.1  Assignment  and  Permutation  Games 

An  example  of  a  permutation  game  is  the  ‘dentist  game’  described  in  Sect.  1.3.4.  An 
example  of  an  assignment  game  is  the  following. 

Example  20.1  Vladimir  (player  1),  Wanda  (player  2),  and  Xavier  (player  3)  each 
own  a  house  that  they  want  to  sell.  Yolanda  (player  4)  and  Zarik  (player  5)  each 
want  to  buy  a  house.  Vladimir,  Wanda,  and  Xavier  value  their  houses  at  1,  1.5,  and 
2,  respectively  (each  unit  is  100,000  Euros).  The  worths  of  their  houses  to  Yolanda 
and  Zarik,  respectively,  are  0.8  and  1.5  for  Vladimir’s  house,  2  and  1.2  for  Wanda’s 
house,  and  2.2  and  2.3  for  Xavier’s  house. 

This  situation  gives  rise  to  a  five-player  TU-game,  where  the  worth  of  each 
coalition  is  defined  to  be  the  maximal  surplus  that  can  be  generated  by  buying  and 
selling  within  the  coalition.  For  instance,  in  the  coalition  {2,3,5}  the  maximum 
surplus  is  generated  if  Zarik  buys  the  house  of  Xavier,  namely  2.3  —  2  =  0.3,  which 
is  greater  than  the  1.2  —  1.5  =  —0.3  that  results  if  Zarik  buys  Wanda’s  house.  Each 
coalition  can  generate  a  payoff  of  at  least  0  because  it  can  refrain  from  trading  at  all. 
The  complete  game  is  described  in  Table  20.1,  where  coalitions  with  only  buyers  or 
only  sellers  are  left  out.  A  game  like  this  is  called  an  assignment  game.  □ 

We  will  examine  such  games  in  detail,  starting  with  the  basic  definitions. 

Let  M  and  P  be  two  finite,  disjoint  sets.  For  each  pair  (ij)  e  M  x  P  the  number 
>  0  is  interpreted  as  the  value  of  the  matching  between  i  and  j.  With  this  situation 
a  cooperative  game  ( N ,  v )  can  be  associated,  as  follows.  The  player  set  N  is  the  set 
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Table  20.1  Worths  for  the 
assignment  game  in 
Example  20.1 


s 

vGS) 

S 

v(S) 

S 

v(S) 

14 

0 

125 

0.5 

345 

0.3 

15 

0.5 

134 

0.2 

1,234 

0.5 

24 

0.5 

135 

0.5 

1,235 

0.5 

25 

0 

145 

0.5 

1,245 

1 

34 

0.2 

234 

0.5 

1,345 

0.7 

35 

0.3 

235 

0.3 

2,345 

0.8 

124 

0.5 

245 

0.5 

12,345 

1 

MUP.  For  each  coalition  S  c  N  the  worth  v  ( S )  is  the  maximum  that  S  can  achieve  by 
making  pairs  among  its  own  members.  Formally,  if  S  c  M  or  S  c  P  then  u(S)  =  0, 
because  no  pairs  can  be  formed  at  all.  Otherwise,  u(S')  is  equal  to  the  value  of  the 
following  integer  programming  problem. 


max  IZieM  ^2jeP  aijxij 

subject  to  J2jep  XU  —  ls( 0  for  all  i  e  M 

E,ga/  x>j  <  IsO)  for  ally  e  P 

Xij  e  {0, 1}  for  all  i  e  M,  j  e  P. 


(20.1) 


Here,  1  s(i)  :=  1  if  i  e  S  and  equal  to  zero  otherwise.  Games  defined  by  (20.1)  are 
called  assignment  games .  The  reader  may  verify  that  in  Example  20.1  the  numbers 
ciij  are  given  by  —  ma x{hy  —  Ci,  0},  where  hy  is  the  value  of  the  house  of  player  i 
to  player  j  and  c;  is  the  value  of  the  house  of  player  i  for  himself. 

As  will  become  clear  below,  a  more  general  situation  is  the  following.  For  each 
i  e  N  —  {1, 2, . . . ,  n)  let  ki7l^  be  the  value  placed  by  player  i  on  the  permutation 
7i  G  II (N).  (The  implicit  assumption  is  that  kijT^  —  kiG(i)  whenever  ti(i)  —  a(i).) 
Each  coalition  S  ^  N  may  achieve  a  permutation  n  involving  only  the  players  of  S , 
that  is,  7 x(i)  —  i  for  all  i  $  S.  Let  II (.S')  denote  the  set  of  all  such  permutations.  Then 
a  game  v  results  by  defining,  for  each  nonempty  coalition  S ,  the  worth  by 


max 

iren(S) 


E**0  ’ 

ieS 


(20.2) 


The  game  thus  obtained  is  called  a  permutation  game.  Alternatively,  the  worth  v(S) 
in  such  a  game  can  be  defined  by  the  following  integer  programming  problem. 


max  'HieN  ^2jeN  kijxij 
subject  to  J2jeNxij  —  ls(0  f°r  all  i  E  N 

T,ieNxij  =  ls(j)  for  ally  g  N  (20.3) 

Xij  G  {0,  1}  for  all  ij  G  N. 

The  two  definitions  are  equivalent,  and  both  can  be  used  to  verify  that  the  ‘dentist 
game’  of  Sect.  1.3.4  is  indeed  a  permutation  game  (Problem  20.1). 
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The  relation  between  the  class  of  assignment  games  and  the  class  of  permutation 
games  is  a  simple  one.  The  former  class  is  contained  in  the  latter,  as  the  following 
theorem  shows. 

Theorem  20.2  Every  assignment  game  is  a  permutation  game. 

Proof  Let  v  be  an  assignment  game  with  player  set  N  =  M  U  P.  For  all  ij  e  N 
define 


^  _  (  aij  if  i  e  M,  j  e  P 
lJ  (0  otherwise. 

Let  w  be  the  permutation  game  defined  by  (20.3)  with  kij  as  above.  Note  that  the 
number  of  variables  in  the  integer  programming  problem  defining  v(S)  is  \M\  x  |P|, 
while  the  number  of  variables  in  the  integer  programming  problem  defining  w(S') 
is  (|M|  +  |P|)2.  For  S  c  M  or  S  c  P9  w(S)  —  0  =  v(S).  Let  now  KiV  with 
S  ^  M  and  S  £  P.  Let  v  e  {0,  l}lMlxlpl  be  an  optimal  solution  for  (20.1).  Define 
x  e  {0,  l}dMl+lpD2  by 


Xy  :=  Xy  if  i  e  MJ  e  P 
Xij  xji  if  i  e  PJ  e  M 


xu  :=  ls(0  Yhxij 

j^p 

Xjj  •—  ls(j)  —  xij 

ieM 


if  i  e  M 


if  j  e  P 


Xij  0  in  all  other  cases. 


Then  x  satisfies  the  conditions  in  problem  (20.3).  Hence,  for  every  S , 


w(s)  >  LL  %%  =  J2  J2  a‘Jx‘j  =  v(sx 

ieN  jeN  ieM  jeP 

On  the  other  hand,  let  z  €  {0,  I  ] 'M+l/’lr  |3C  an  optimal  solution  for  (20.3).  Define 
z  e  {0,  l}lMlxlpl  by 


Zij  :=  zy  for ;  e  M.  j  e  P  . 

Then  z  satisfies  the  conditions  in  problem  (20.1).  Hence,  for  every  S, 

v(s)  >  L  L =  L  L  k;Jz‘j  =  w w  • 

ieM  jeP  ieM  jeP 


Consequently,  v  —  w  . 
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The  converse  of  Theorem  20.2  is  not  true,  as  the  following  example  shows.  As  a 
matter  of  fact,  a  necessary  condition  for  a  permutation  game  to  be  an  assignment 
game  is  the  existence  of  a  partition  of  the  player  set  N  of  the  permutation  game  into 
two  subsets  N\  and  A^,  such  that  the  value  of  a  coalition  S'  is  0  whenever  S  c  N\  or 
S  c  Af2.  The  example  shows  that  this  is  not  a  sufficient  condition. 

Example  20.3  Let  N  —  {1,2,3}  and  let  v  be  the  permutation  game  with  the 
numbers  ky  given  in  the  following  matrix: 

0  2  1 
1  0  0 
2  0  0 

Then  v(i)  —  0  for  every  i  e  N ,  i?(l,  2)  =  i?(l,  3)  =  3,  v(2,  3)  =  0,  and  v(N)  =  4. 
Note  that  this  game  satisfies  the  condition  formulated  above  with  N\  —  {1}  and 
N2  —  {2,3},  but  it  is  not  an  assignment  game  (Problem  20.2).  □ 

The  main  purpose  of  this  section  is  to  show  that  permutation  games  and,  hence, 
assignment  games  are  balanced  and,  in  fact,  totally  balanced.  A  TU-game  ( N ,  v ) 
is  totally  balanced  if  the  subgame  (. M ,  v) — where  v  is  the  restriction  to  M — is 
balanced  for  every  M  c  N.  Balanced  games  are  exactly  those  games  that  have  a 
non-empty  core,  see  Chap.  16. 

Theorem  20.4  Assignment  games  and  permutation  games  are  totally  balanced. 

Proof  In  view  of  Theorem  20.2,  it  is  sufficient  to  prove  that  permutation  games 
are  totally  balanced.  Because  any  subgame  of  a  permutation  game  is  again  a 
permutation  game  (see  Problem  20.3),  it  is  sufficient  to  prove  that  any  permutation 
game  is  balanced. 

Let  ( N ,  v)  be  a  permutation  game,  defined  by  (20.3).  By  the  Birkhoff-von 
Neumann  Theorem  (Theorem  22.12)  the  integer  restriction  can  be  dropped  so  that 
each  u(S)  is  also  defined  by  the  following  program: 

max  ^2ieN  jeN  kijxij 

subject  to  J2jeNxv  —  ls(0  f°r  all  i  G  N 
J2ieNxij  =  h(j)  for  all  j  €  N 
Xy  >  0  for  all  ij  e  N. 


(20.4) 
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Note  that  this  is  a  linear  programming  problem  of  the  same  format  as  the 
maximization  problem  in  Theorem  16.20.  Namely,  with  notations  as  there,  take 

y  (-X 1 1  ?  •  •  •  ,  %\n  i  %21 5  •  •  •  ?  *^2 n  ,  •  •  •  •>  n  1  ?  •  •  •  ?  -^nn) 

b  —  ( k\  \ ,  ...  ,  k\n ,  /^2l , . . .  ,  k2m  •  •  •  >  , . . . , 

c  =  (Is,  Is)  . 


Further,  let  A  be  the  2 n  x  n2 -matrix  with  row  t  G  {1, . . .  ,n}  containing  a  1  at 
columns  t ,  l  +  n,  i  +  2n, . . . ,  l  +  (n  —  X)n  and  zeros  otherwise;  and  with  row  t  +  n 
(i  G  { 1 , . . . ,  n})  containing  a  1  at  columns  (i  —  \)n  +  1 , . . . ,  t n  and  zeros  otherwise. 
The  corresponding  dual  problem,  the  minimization  problem  in  Theorem  16.20,  then 
has  the  form: 


min  E,€,V  1  s(i)yi  +  T,j€N  ls(j)Zj 

subject  to  yi  +  zj  >  ky  for  all  ij  e  N.  (20.5) 

Let  (y,  z)  be  an  optimal  solution  of  problem  (20.5)  for  S  —  N.  Then,  by 
Theorem  16.20  and  the  fact  that  the  maximum  in  problem  (20.4)  for  S  =  N  is 
equal  to  v(N)  by  definition,  it  follows  that 

D( 9i  +  Zi )  =  v(N)  . 

ieN 

Since  (y,  z)  satisfies  the  restrictions  in  problem  (20.5)  for  every  S  c  it 
furthermore  holds  that  for  every  S  c  N, 

+  h)  =  L  +  L  !s(02;  -  "W  • 

/gS  ieN  ieN 

Therefore,  u  G  RN  defined  by  ut  :=  %  +  Zi  is  in  the  core  of  v.  ■ 


20.2  Flow  Games 

In  this  section  another  class  of  balanced  games  is  considered.  These  games  are 
derived  from  the  following  kind  of  situation.  There  is  a  given  capacitated  network, 
the  edges  of  which  are  controlled  by  subsets  of  players.  These  coalitions  can  send 
a  flow  through  the  network.  The  flow  is  maximal  if  all  players  cooperate,  and  then 
the  question  arises  how  to  distribute  the  profits.  One  can  think  of  an  almost  literal 
example,  where  the  edges  represent  oil  pipelines,  and  the  players  are  in  power  in 
different  countries  through  which  these  pipelines  cross.  Alternatively,  one  can  think 
of  rail  networks  between  cities,  or  information  channels  between  different  users. 
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Fig.  20.1  Example  20.5 


<?3  W3 

- •  5 

10 


Capacitated  networks  are  treated  in  Sect.  22.7,  which  the  reader  may  consult 
before  continuing. 

Consider  a  capacitated  network  (V,  E,  k)  and  a  set  of  players  N  {1 , ,n}. 
Suppose  that  with  each  edge  in  E  a  simple  game  (cf.  Sect.  16.3.)  is  associated.  The 
winning  coalitions  in  this  simple  game  are  supposed  to  control  the  corresponding 
edge;  the  capacitated  network  is  called  a  controlled  capacitated  network.  For  any 
coalition  S  c  N  consider  the  capacitated  network  arising  from  the  given  network  by 
deleting  the  edges  that  are  not  controlled  by  S.  A  game  can  be  defined  by  letting  the 
worth  of  S  be  equal  to  the  value  of  a  maximal  flow  through  this  restricted  network. 
The  game  thus  arising  is  called  a  flow  game. 

Example  20.5  Consider  the  capacitated  network  in  Fig.  20.1.  This  network  has 
three  edges  denoted  e\,  and  e 3  with  capacities  4,  5  and  10,  respectively.  The 
control  games  are  W\,W2,  W3  with  N  —  {1,2,3}  and 

w\(S)  =  1  if  S  e  {{ 1 , 2},  TV}  and  w\(S)  —  0  otherwise 

W2 (*S)  =  1  if  S  e  {{1, 3},  N}  and  W2(S)  =  0  otherwise 

W3  (1 S )  =  1  if,  and  only  if,  1  e  S. 

The  coalition  {1,2}  can  only  use  the  edges  e\  and  £3,  so  the  maximal  flow  (per 

time  unit)  for  {1,2}  is  4.  This  results  in  i?({l,  2})  =  4  for  the  corresponding  flow 

game  (N,  v).  The  complete  game  is  given  by  v(i)  —  0  for  all  i  e  N,  v({l,  2})  =  4, 
u({l,  3})  =  5,  v({2,  3})  =  0  and  v(N)  =  9. 

A  minimum  cut  in  this  network  corresponding  to  the  grand  coalition  is  ({g},  V  \ 
{q}).  By  the  Max  Flow  Min  Cut  Theorem  of  Ford  and  Fulkerson  (Theorem  22.16), 
the  sum  of  the  capacities  of  e\  and  £2  (4  +  5)  is  equal  to  v(N).  Divide  v(N)  as 
follows.  Divide  4  equally  among  the  veto  players  of  w  1,  and  5  equally  among  the 
veto  players  of  W2.  The  result  for  the  players  is  the  payoff  vector  (4^,  2, 2^).  Note 
that  this  vector  is  in  C(v).  □ 

The  next  theorem  shows  that  the  non-emptiness  of  the  core  of  the  control  games  is 
inherited  by  the  flow  game. 

Theorem  20.6  Suppose  all  control  games  in  a  controlled  capacitated  network  have 
veto  players.  Then  the  corresponding  flow  game  is  balanced. 
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Proof  Take  a  maximal  flow  for  the  grand  coalition  and  a  minimum  cut  in  the 
network  for  the  grand  coalition,  consisting  of  the  edges 


e\ ,  e2, . . . ,  ep  with  capacities  k\,  Jc2, . . . ,  kp 


and  control  games  w i,  w 2,  . . wp,  respectively.  Then  Theorem  22.16  implies  that 
v(N)  —  Yfr=i  kr-  For  each  r  take  xr  e  C(wr )  and  divide  kr  according  to  the  division 
key  xr  (i.e.  krx\  is  the  amount  for  player  /).  Note  that  non- veto  players  get  nothing. 
Then  J?r=l  krxr  e  C(v).  To  see  this,  first  note  that 

n  p  p  n  p 

krx\  —  kr  x\  —  kr  —  v  (N)  . 

i=l  r=  1  r=  1  i=  1  r=  1 


Next,  for  each  coalition  S ,  the  set 


Es  :=  {er  :  r  e  {1, . . . p},wr(S )  =  1} 


is  associated  with  a  cut  of  the  network,  governed  by  the  coalition  S.  Hence,  J2iss 

(  £r=l  krXi  )  =  £r=  1  ^  £i€S  >  £r=l  krWr  (S)  =  J2er€Es  kr  = 
capacity  (Es)  >  1^(6'),  where  the  last  inequality  follows  from  Theorem  22.16.  ■ 


The  next  theorem  is  a  partial  converse  to  Theorem  20.6. 


Theorem  20.7  Each  nonnegative  balanced  game  arises  from  a  controlled  capaci¬ 
tated  network  where  all  control  games  possess  veto  players. 


Proof  See  Problem  20. 


5. 


20.3  Voting  Games:  The  Banzhaf  Value 

Voting  games  constitute  another  special  class  of  TU-games.  Voting  games  are  simple 
games  which  reflect  the  distribution  of  voting  power  within,  for  instance,  political 
systems.  There  is  a  large  body  of  work  on  voting  games  within  the  political  science 
literature.  In  this  section  we  restrict  ourselves  to  a  brief  discussion  of  a  well-known 
example  of  a  power  index,  to  so-called  Banzhaf-Coleman  index  and  the  associated 
value,  the  Banzhaf  value. 

A  power  index  is  a  value  applied  to  voting  (simple)  games.  The  payoff  vector 
assigned  to  a  game  is  interpreted  as  reflecting  power  distribution — e.g.,  the  proba¬ 
bility  of  having  a  decisive  vote — rather  than  utility. 

We  start  with  an  illustrating  example. 
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Example  20.8  Consider  a  parliament  with  three  parties  1,  2,  and  3.  The  numbers 
of  votes  are,  respectively,  50,  30,  and  20.  To  pass  any  law,  a  two-third  majority 
is  needed.  This  leads  to  a  simple  game  with  winning  coalitions  {1,2},  {1,3},  and 
{1, 2,  3}.  The  Shapley  value  of  this  game  is  (3,  g,  g),  as  can  easily  be  checked.  By 
definition  of  the  Shapley  value  this  means  that  in  four  of  the  six  orderings  player 
1  makes  the  coalition  of  his  predecessors  winning  by  joining  them,  whereas  for 
players  2  and  3  this  is  only  the  case  in  one  ordering  each.  The  coalitions  that  are 
made  winning  by  player  1  if  he  joins,  are  {2},  {3},  and  {2,  3}.  In  the  Shapley  value 
the  last  coalition  is  counted  double.  It  might  be  more  natural  to  count  this  coalition 

oil 

only  once.  This  would  lead  to  an  outcome  (|,  |),  instead  of  the  Shapley  value. 

The  associated  value  is  called  the  normalized  Banzhaf-Coleman  index.  □ 

For  a  simple  game  (N,  v )  (see  Sect.  16.3),  the  normalized  Banzhaf-Coleman  index 
can  be  defined  as  follows.  Define  a  swing  for  player  i  as  a  coalition  S  c  N  with 
i  G  S,  S  wins,  and  S  \  {/}  looses.  Let  0i  be  the  number  of  swings  for  player  i,  and 
define  the  numbers 


Pi(N,  v) 


The  vector  fi(N,  v )  is  the  normalized  Banzhaf-Coleman  index  of  the  simple  game 
(N,  v). 

For  a  general  game  (N,  v )  write 


9i(v)  :=  ^  [i>(5  U  0  -  v(S)]  . 

.S'CiV:  ,ys 


For  a  simple  game  v  this  number  ft(r>)  coincides  with  the  number  ft  above. 

Next,  define  the  value  &  :  QN  — >►  RN  by 

^  :=  Sr  =  E  ^k_Nsuo-Ks)].  (20.6) 

S^N:  igS 

The  value  is  called  the  Banzhaf  value.  The  remainder  of  this  section  is  devoted 
to  an  axiomatic  characterization  of  this  value.  This  characterization  uses  the  axioms 
SYM  (Symmetry),  SMON  (Strong  Monotonicity),  and  DUM  (Dummy  Property), 
which  were  all  introduced  in  Chap.  17.  Besides,  it  uses  an  ‘amalgamation’  property, 
as  follows. 


1  Also  called  the  Shapley -Shubik  power  index  in  this  context. 
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For  a  game  (N,  v)  (with  at  least  two  players)  and  different  players  i,j  put  p  —  {i,j} 
and  define  the  game  ((N  \  p)  U  {p },  vp )  by 

vp(S)  —  v(S)  and  vp(S  U  {p })  =  v(S  U  p),  for  any  S  c  N  \p  .  (20.7) 

Thus,  vp  is  an  (n  —  l)-person  game  obtained  by  amalgamating  players  i  and  j  in  v 
into  one  player  p  in  vp. 

Let  f  be  an  arbitrary  value  (on  the  class  Q  of  all  games  with  arbitrary  player  set). 
The  announced  axiom  is  as  follows. 

2-Efficiency  (2-EFF):  xfr fiv )  +  fijfv)  —  x/fp(vp)  for  all  v,  i,j,p,  vp  as  above. 

The  following  theorem  gives  a  characterization  of  the  Banzhaf  value. 

Theorem  20.9  The  value  x/s  on  Q  satisfies  2-EFF,  SYM,  DUM,  and  SMON,  if  and 
only  iff  is  the  Banzhaf  value  T*. 

Proof  That  the  Banzhaf  value  satisfies  the  four  axioms  in  the  theorem  is  the  subject 
of  Problem  20.9.  For  the  converse,  let  xjr  be  a  value  satisfying  the  four  axioms.  We 
prove  that  —  P. 

Step  1 

Let  ut  be  a  unanimity  game.  We  first  show  that 

fi(uT)  =  1  /2|r|_1  if  i  e  T  and  fi{uT)  =  0  if  i$T.  (20.8) 

If  |T|  =  1  then  every  player  is  a  dummy,  so  that  (20.8)  follows  from  DUM.  Suppose 
(20.8)  holds  whenever  \  T\  <  k  or  \N\  <  m,  and  consider  a  unanimity  game  ut  where 
now  the  number  of  players  is  m  +  1,  and  T  contains  k  +  1  players.  Let  ij  e  T,  put 
p  —  {i,j}  and  consider  the  game  ( ut)p .  Then  ( ut)p  is  the  m-person  unanimity  game 
of  the  coalition  Tf  —  (T  \p)  U  {p },  and  \T'\  —  k.  By  the  induction  hypothesis 

M(ut)p)  =  l/2|r|-‘  =  1/2*"1. 

By  2-EFF  this  implies 

ti(uT)  +  tj(uT)  =  \/2k~y. 

From  this  and  SYM  it  follows  that 

ti(uT)  =  1/2*  =  1/2^-' 

for  all  i  e  T,  and  by  DUM,  fij(uT)  —  0  for  all  j  $  T.  Thus,  f  is  the  Banzhaf  value 
on  unanimity  games  for  any  finite  set  of  players.  In  the  same  way,  one  shows  that 
this  is  true  for  any  real  multiple  cuT  of  a  unanimity  game. 
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Step  2 

For  an  arbitrary  game  v  write  v  =  cjuj,  and  let  a(v)  denote  the  number  of 

nonzero  coefficients  in  this  representation.  The  proof  will  be  completed  by  induction 
on  the  number  a  (v)  and  the  number  of  players.  For  a(v)  =  1  Step  1  implies  \j/(v)  — 
v )  independent  of  the  number  of  players.  Assume  that  v )  on  any 

game  v  with  at  most  n  players,  and  also  any  game  v  with  a(v)  <  k  for  some  natural 
number  k  and  with  n  +  1  players,  and  let  v  be  a  game  with  n  +  1  players  and  with 
a(v)  =  k  +  1.  There  are  k  +  1  different  nonempty  coalitions  T\ , . . . ,  TVh  with 

k+ 1 

v  —  T:  cTruTr , 

r=  1 

where  all  coefficients  are  nonzero.  Let  T  :=  T\  D  . . .  D  Tk+\.  Because  k  +  1  >  2,  it 
holds  that  N\T  ^0.  Assume  i  ^  T.  Define  the  game  w  by 

W  =  cTrMTr  . 

r:  /G7V 


Then  a(w)  <  k  and  v(S  U  i)  —  v(S)  =  w(S  U  i)  —  w(S)  for  every  coalition  S  not 
containing  player  i.  By  SMON  and  the  induction  hypothesis  it  follows  that  ^(u)  = 
fiiyv)  =  &i(w)  =  ^(u).  Hence, 


^•(u)  =  ^/(u)  for  every  i  e  N\T.  (20.9) 

Let  j  £  T  and  i  e  N\T,  put  p  =  {i,j},  and  consider  the  game  vp  .  Because  the  game 
vp  has  n  players  the  induction  hypothesis  implies 

ifp(vp)  =  %(vp)  .  (20.10) 

Applying  2-EFF  to  both  \[r  and  ^  yields 

%(vp)  =  fi(v)  +  fjiv)  and  %{vp)  =  ^(u)  +  Vj(v)  .  (20.11) 

Combining  (20.9)-(20.11)  implies  Vo  (u)  —  ^jiv)  f°r  every  j  G  T.  Together  with 
(20.9)  this  completes  the  induction  argument,  and  therefore  the  proof.  ■ 


20.4  Problems 

20.1.  The  Dentist  Game 

Show  that  (20.2)  and  (20.3)  are  equivalent,  and  use  each  of  these  to  verify  that  the 
dentist  game  of  Sect.  1.3.4  is  a  permutation  game. 
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Fig.  20.2  The  network  of  ei  e 3 

Problem  20.4 


20.2.  Example  20.3 

Show  that  the  game  in  Example  20.3  is  not  an  assignment  game. 

20.3.  Subgames  of  Permutation  Games 

Prove  that  subgames  of  permutation  games  are  again  permutation  games.  Is  this  also 
true  for  assignment  games? 

20.4.  A  Flow  Game 

Consider  the  network  in  Fig.  20.2.  Suppose  that  this  is  a  controlled  capacitated 
network  with  player  set  N  =  {1, 2,  3,  4},  suppose  that  all  edges  have  capacity  1 
and  that  w\  =  S\,  W2  =  S2,  W3  =  S3  and  W4(S )  =  1  iff  S  e  {{3,  4}, TV}.  [Here,  Si  is 
the  simple  game  where  a  coalition  is  winning  if,  and  only  if,  it  contains  player  /.] 

(a)  Calculate  the  corresponding  flow  game  (N,  v). 

(b)  Calculate  C(u). 

(c)  The  proof  of  Theorem  20.6  describes  a  way  to  find  core  elements  by  looking  at 
minimum  cuts  and  dividing  the  capacities  of  edges  in  the  minimum  cut  in  some 
way  among  the  veto  players  of  the  corresponding  control  game.  Which  elements 
of  C(v)  can  be  obtained  in  this  way? 

20.5.  Every  Nonnegative  Balanced  Game  is  a  Flow  Game 

Prove  that  every  nonnegative  balanced  game  is  a  flow  game.  [Hint:  You  may  use 
the  following  result:  every  nonnegative  balanced  game  can  be  written  as  a  positive 
linear  combination  of  balanced  simple  games.] 

20.6.  On  Theorem  20.6  (1) 

(a)  Consider  a  controlled  capacitated  network  with  a  minimum  cut,  where  all 
control  games  corresponding  to  the  edges  in  this  minimum  cut  (connecting 
vertices  between  the  two  sets  in  the  cut)  have  veto  players.  Prove  that  the 
corresponding  flow  game  is  balanced. 

(b)  Show  that  the  flow  game,  corresponding  to  Fig.  20.3,  where  the  winning 
coalitions  of  w  1  are  {1,3},  {2,4}  and  N  —  {1,2,  3,  4},  where  the  winning 
coalitions  of  W2  are  {1,2}  and  N  and  of  W3  {3,  4}  and  N  and  where  the  capacities 
are  1,  10,  10  respectively,  has  a  nonempty  core.  Note  that  there  is  no  minimum 
cut  where  all  control  games  have  veto  players. 
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Fig.  20.3  The  network  of 
Problem  20.6 


Fig.  20.4  The  network  of 
Problem  20.7 


q 


► 


► 


W3 

- •  S 


w  2 


20.7.  On  Theorem  20.6  (2) 

Prove  that  the  two-person  flow  game  corresponding  to  the  controlled  capacitated 
network  of  Fig.  20.4  has  an  empty  core,  where  w\  —  <5i,  w^  =  82,  W3 (S)  —  1  if 
5^0,  and  where  the  capacities  of  the  edges  are  equal  to  1. 

20.8.  Totally  Balanced  Flow  Games 

Let  (N,  v)  be  the  flow  game  corresponding  to  a  controlled  capacitated  network 
where  all  control  games  are  dictatorial  games  (games  of  the  form  Si,  see  Prob¬ 
lem  20.4).  Prove  that  each  subgame  ( S ,  vs )  (where  vs  is  the  restriction  of  v  to  2s ) 
has  a  nonempty  core,  i.e.,  that  the  game  (N,  v )  is  totally  balanced. 

20.9.  If-Part  of  Theorem  20.9 

Prove  that  the  Banzhaf  value  satisfies  2-EFF,  SYM,  DUM,  and  SMON.  Is  it  possible 
to  weaken  DUM  to  NP  (the  null-player  property)  in  Theorem  20.9?  Give  an  example 
showing  that  the  Banzhaf  value  is  not  efficient. 


20.5  Notes 

The  presentation  in  Sect.  20.1  is  mainly  based  on  Curiel  (1997,  Chap.  3).  Exam¬ 
ple  20.1  is  from  this  book.  Assignment  games  were  introduced  by  Shapley  and 
Shubik  (1972).  Permutation  games  were  introduced  by  Tijs  et  al.  (1984). 

Theorem  20.6  on  flow  games  is  due  to  Curiel  et  al.  (1986). 

In  the  literature  many  characterizations  of  power  indices  are  available.  The  one 
presented  in  Sect.  20.3  is  based  on  Nowak  (1997). 

For  the  result  in  the  hint  to  Problem  20.5  see  Derks  (1987).  Problem  20.8  refers 
to  Kalai  and  Zemel  (1982). 
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Bargaining  Problems 


21 


The  game-theoretic  literature  on  bargaining  can  be  divided  in  two  strands:  the 
cooperative  and  the  noncooperative  approach.  Here,  the  focus  is  on  the  cooperative 
approach,  which  was  initiated  by  Nash  (1950)  and  which  is  axiomatic  in  nature; 
see  Sect.  10.1  for  a  first  discussion.  A  seminal  article  on  noncooperative  bargaining 
is  Rubinstein  (1982).  The  basic  idea  of  that  paper  is  briefly  repeated  below,  but 
see  Sect.  6.7  for  a  more  elaborate  discussion.  We  conclude  the  chapter  with  a  few 
remarks  on  games  with  nontransferable  utility  (NTU-games). 


21 .1  The  Bargaining  Problem 

A  2 -person  bargaining  problem  is  a  pair  ( S ,  d)  where  S  is  a  compact  convex 
nonempty  subset  of  Mr  and  d  is  an  element  of  S  such  that  x  >  d  for  some  x  G  S. 
The  elements  of  S  are  called  outcomes  and  d  is  the  disagreement  outcome.  The 
interpretation  of  such  a  problem  (S,  d)  is  as  follows.  Two  bargainers,  1  and  2,  have 
to  agree  on  some  outcome  x  g  S,  yielding  utility  xt  to  bargainer  i.  If  they  fail  to 
reach  such  an  agreement,  they  end  up  with  the  disagreement  utilities  d  =  (di,  <72)- 
B  denotes  the  family  of  all  2-person  bargaining  problems. 

A  ( bargaining )  solution  is  a  map  F  :  B  ->  M1 2  such  that  F(S,  d)  G  S  for  all 
(i S ,  d)  G  B.  Nash  (1950)  proposed  to  characterize  such  a  solution  by  requiring  it  to 
satisfy  certain  axioms.  More  precisely,  he  proposed  the  following  axioms. 

Weak  Pareto  Optimality  (WPO):  F(S,  d)  G  W(S )  for  all  (S,d)  e  B ,  where 
W(S)  :=  {x  g  S  |  Vy  g  M2  :  y  >  x  =>►  y  $S}  is  the  weakly  Pareto  optimal  subset  of  S. 


1  See  Fig.  10.2  for  an  illustration  of  these  axioms.  In  Sect.  10.1  the  stronger  Pareto  Optimality  is 
imposed  instead  of  Weak  Pareto  Optimality.  In  the  diagram — panel  (a) — that  does  not  make  a 
difference. 

2 The  notation  y  >  x  means  yt  >  xt  for  i  =  1,2. 
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Symmetry  (SYM):  F\(S,  d)  =  /^(S,  d)  for  all  (S,  d)  G  B  that  are  symmetric ,  i.e., 
d\  —  d2  andS  =  {(x2,x\)  e  M2  |  (. x\,X2 )  G  S}. 


Scale  Covariance  (SC):  F(aS  +  b,  ad  +  b)  =  a F(S,  d)  +  b  for  all  ( S ,  d)  g  B ,  where 


b  g  M2,  a  g 

aS  +  b  :=  {ax  +  b 


ax  :=  ( a\X\,ci2X2 )  for  all  x  g 

X  G  S}.3 


M2,  aS  :=  {ax  |  x  g  S},  and 


Independence  of  Irrelevant  Alternatives  (IIA):  F(S,  d)  =  F(T,e)  for  all  (S,  d), 
(r,  e)  g  B  with  d  =  e,  S  c  T,  and  F(T,  e)  g  S. 

Weak  Pareto  Optimality  says  that  it  should  not  be  possible  for  both  bargainers  to 
gain  with  respect  to  the  solution  outcome.  If  a  game  is  symmetric,  then  there  is 
no  way  to  distinguish  between  the  bargainers,  and  a  solution  should  not  do  that 
either:  that  is  what  Symmetry  requires.  Scale  Covariance  requires  the  solution  to  be 
covariant  under  positive  affine  transformations:  the  underlying  motivation  is  that  the 
utility  functions  of  the  bargainers  are  usually  assumed  to  be  of  the  von  Neumann- 
Morgenstern  type,  which  implies  that  they  are  representations  of  preferences  unique 
only  up  to  positive  affine  transformations  (details  are  omitted  here).  Independence 
of  Irrelevant  Alternatives  requires  the  solution  outcome  not  to  change  when  the  set 
of  possible  outcomes  shrinks,  the  original  solution  outcome  still  remaining  feasible. 

The  Nash  ( bargaining )  solution  N  :  B  ->  M2  is  defined  as  follows.  For  every 
(S,  d)  G  B , 


N(S,  d)  =  argmax{(vi  —  d\)(x2  —  ^2)  |  x  G  S,  x  >  d}  . 

That  the  Nash  bargaining  solution  is  well  defined,  follows  from  Problem  21.3. 

The  following  theorem  shows  that  the  four  conditions  above  characterize  the 
Nash  bargaining  solution. 

Theorem  21.1  Let  F  :  B  — >  M2  be  a  bargaining  solution.  Then  the  following  two 
statements  are  equivalent: 

(a)  F  —  N  . 

(b)  F  satisfies  WPO,  SYM,  SC,  IIA. 

Proof  The  implication  (a)=Kb)  is  the  subject  of  Problem  21.4.  For  the  implication 
(b)=Ka),  assume  F  satisfies  WPO,  SYM,  SC,  and  IIA.  Let  (S',  d)  g  B ,  and  z  := 
N(S,  d).  Note  that  z  >  d.  Let  T  :=  {((zi  —  (Z2  ~  ^2)-1)(x  —  d)  |  x  g  S}  . 

By  SC, 


F(T,  0) 


F\  (S,  d)  F2(S,  d)\ 
z\  —  d\  ’  Z2  —  d2  ) 


d\ 


d2 


Z\ 


-  d\  Z2  —  d2 


3R2++  =  {x  =  (xi , xf)  G  R2  |  X\,X2  >  0}. 


(21.1) 


21 .2  The  Raiffa-Kalai-Smorodinsky  Solution 
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Fig.  21.1  Proof  of 
Theorem  21.1 


(2  +  a,  —a) 


and 

N(T,  0)  -  (Z±-^-,  Z2^f]  =  (U)  •  (21-2) 

\z\  -  a  1  Z2-d2J 

Hence,  in  order  to  prove  F(S,  d)  =  N(S,  d),  it  is,  in  view  of  (21.1)  and  (21.2), 
sufficient  to  show  that  F(T,  0)  =  (1, 1).  By  (21.2)  and  Problem  21.5,  there  is  a 
supporting  line  of  T  at  (1, 1)  with  slope  —1.  So  the  equation  of  this  supporting  line 
is  x\  +  x2  —  2.  Choose  a  >  0  so  large  that  fCD:=  conv{(— a,  —a),  (—a,  2  +  a), 
(2  +  a,  —a)}  .  Cf.  Fig.  21.1. 

Then  (D,  0)  e  B ,  ( D ,  0)  is  symmetric,  and  W(D )  =  conv{(— af,2  +  a),  (2  + 
a,  —o')}  .  Hence  by  SYM  and  WPO  of  F: 

F(D,  0)  =  (1,1)  .  (21.3) 

Since  T  c  D  and  (1, 1)  e  T,  we  have  by  IIA  and  (21.3):  F(T,  0)  =  (1, 1)  .  This 
completes  the  proof.  ■ 


21 .2  The  Raiffa-Kalai-Smorodinsky  Solution 

Kalai  and  Smorodinsky  (1975)  replaced  Nash’s  IIA  (the  most  controversial  axiom 
in  Theorem  21.1)  by  the  following  condition.  For  a  problem  (S,  d)  e  B , 

u(S,  d)  :=  (maxjvi  |  x  £  S,  x  >  d},  max{v2  |  x  G  S,  x  >  d}) 


is  called  the  utopia  point  of  (S,  d). 
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Individual  Monotonicity  (IM):  Fj(S,  d)  <  Fj(T,e)  for  all  (S,  d),  (T,  e)  G  5  and 
/j  G  {1,2}  with  i  ^  7,  d  =  e,  S  c  r,  and  Wj(S,  d)  =  ^(T,  e). 

The  Raiffa-Kalai-Smorodinsky  solution  R  :  B  ^  R2  is  defined  as  follows.  For 
every  (S,  d)  G  7?(S,  d)  is  the  point  of  intersection  of  W(S )  with  the  straight  line 
joining  d  and  u(S,d). 

The  following  theorem  presents  a  characterization  of  the  Raiffa-Kalai- 
Smorodinsky  solution.  In  order  to  understand  the  proof  it  is  recommended  to 
draw  pictures,  just  as  in  the  proof  of  the  characterization  of  the  Nash  bargaining 
solution. 

Theorem  21.2  Let  F  :  B  —>  R2  be  a  bargaining  solution.  Then  the  following  two 
statements  are  equivalent: 

(a)  F  =  R  . 

(b)  F  satisfies  WPO,  SYM,  SC,  and  IM  . 

Proof  The  implication  (a)=Kb)  is  the  subject  of  Problem  21.7.  For  the  converse 
implication,  assume  F  has  the  four  properties  stated.  Let  (S,  d)  G  B  and  let  T  \— 
{ax  +  b  |  x  g  S}  with  a  :=  ((u\(S,  d)  —  df)~l,  (u2(S,d)  —  <i2)-1 ),  b  :=  =ad.  By 
SC  of  R  and  F,  R(T ,  0)  =  a R(S,  d)  +  b  and  F(T,  0)  =  a F(S,  d)  +  b.  Hence,  for 
F(S,  d)  =  R(S,  d),  it  is  sufficient  to  prove  that  R(T,  0)  =  F(T,  0). 

Since  u(T,  0)  =  (1,1),  R(T,  0)  is  the  point  of  W(T)  with  equal  coordi¬ 
nates,  so  Ri(T,  0)  =  R2(T,0).  If  R(T,0)  =  (1,1)  =  u(T,  0),  then  let  L  := 
conv{(0,  0),  (1, 1)}.  Then  by  WPO,  F(L,  0)  =  (1, 1),  so  by  IM,  F(T,  0)  >  F(L,  0), 
hence  F(T,  0)  =  F(L,  0)  =  R(T,  0). 

Next  assume  R(T,  0)  <  (1, 1).  Let  f  :=  {x  g  R2  \  y  <  x  <  z  for  some  y,  z  g  T}. 
Clearly  T  c  f  and  u(T ,  0)  =  u(T ,  0)  =  (1, 1)  so  by  IM: 

F(T,  0)  >  F(T,0)  ,  (21.4) 

and  further,  since  R(T,  0)  G  W(T)  and  R\ (T,  0)  =  R2(T,  0), 

R(f,  0)  =  R(T,  0)  .  (21.5) 

Let  V  :=  conv{0,  R(T,  0),  (1,0),  (0, 1)}.  By  WPO  and  SYM,  F(V,  0)  =  R(T,  0). 
By  V  c  f,  u(V,  0)  =  u(T,  0)  =  (1, 1),  and  IM,  we  have  F(f,0)  >  F(y,0)  = 
R(T,  0),  hence  F(T,  0)  =  R(T,  0).  Combined  with  (21.4),  this  implies  R(T,  0)  > 
F(T,  0),  hence  R(T,  0)  =  F(T,  0)  by  WPO  and  the  fact  R(T,  0)  <  (1, 1).  This 
completes  the  proof.  ■ 
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21 .3  The  Egalitarian  Solution 

Consider  the  following  two  properties  for  a  bargaining  solution  F. 

Pareto  Optimality  (PO):  F(S.  d)  e  P(S)  for  all  ( S ,  d)  e  B,  where  P(S)  :=  {x  e  S 
Vye5  :  y  >  x  =>>  y  =  x}  is  the  Pareto  optimal  subset  of  S. 

Monotonicity  (MON):  F(S,  d)  <  F(T,  e)  for  all  (, S ,  d),  (T,  e)  G  B  with  S  c  T  and 

d  =  e. 

Clearly,  P(S)  c  W(S)  for  every  (S,d)  G  B ,  and  Pareto  optimality  is  a 
stronger  requirement  than  Weak  Pareto  Optimality.  The  Nash  and  Raiffa-Kalai- 
Smorodinsky  solutions  are  Pareto  optimal,  and  therefore  WPO  can  be  replaced  by 
PO  in  Theorems  21.1  and  21.2.  Monotonicity  is  much  stronger  than  Individual 
Monotonicity  or  Restricted  Monotonicity  (see  Problem  21.8  for  the  definition  of 
the  last  axiom)  and  in  fact  it  is  inconsistent  with  Weak  Pareto  Optimality.  (See 
Problem  21.10.) 

Call  a  problem  (, S ,  d)  G  B  comprehensive  if  z  <  y  <  x  implies  y  G  S  for  all  z, 
x  G  S,  y  G  M2.  By  Bc  we  denote  the  subclass  of  comprehensive  problems. 

The  egalitarian  solution  E  :  Bc  ->  M2  assigns  to  each  problem  (5,  d)  e  Bc  the 
point  E(S ,  d)  e  W(S )  with  E\  (, S ,  d)  —  d\  —  E2  (S,  d)  —  ^2- 
The  following  axiom  is  a  weakening  of  Scale  Covariance. 

Translation  Covariance  (TC):  F(S  +  e,  d  +  e)  =  F(S,  d)  +  e  for  all  (, S ,  d)  e  Bc  and 
all  e  G  M2. 

The  following  theorem  gives  a  characterization  of  the  egalitarian  solution  based  on 
Monotonicity. 

Theorem  21.3  Let  F  :  Bc  —>  M2  be  a  bargaining  solution.  Then  the  following  two 
statements  are  equivalent: 

(a)  E  —  E  . 

(b)  F  satisfies  WPO,  MON,  SYM,  and  TC. 

Proof  The  implication  (a)=Kb)  is  the  subject  of  Problem  21.11.  For  the  converse 
implication,  let  ( S ,  d)  G  Bc .  We  want  to  show  F(S,  d)  =  E(S,  d). 

In  view  of  TC  of  F  and  E ,  we  may  assume  d  =  0.  Let  V  :=  {x  g  M2  |  0  <  x  < 
E(S,  0)}.  Clearly,  (V,  0)  G  Bc  is  a  symmetric  problem,  so  F(V,  0)  =  £(^,0)  by 
SYM  and  WPO  of  F.  By  MON, 

F(S,  0)  >  F(V,  0)  =  E(S,  0)  .  (21.6) 


4The  notation  y  >  x  means  yf  >  xt  for  i  =  1,2. 
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If  E(S,  0)  £  P(S ),  then  (21.6)  implies  F(S,  0)  =  £(S,  0),  so  we  are  done.  Now 
suppose  E(S,  0)  £  W(S)  \  P(S).  Without  loss  of  generality,  assume  E\(S,  0)  = 
^i(S,  0),  i.e.,  E\(S,  0)  =  max{xi  |  x  £  .S',  x  >  0}.  Hence,  Ffi(S,  0)  —  Fi(S,  0)  by 
(21.6). 

Suppose  F2(S,  0)  >  E2(S,  0).  The  proof  will  be  finished  by  contradiction.  Let 
a  >  0  with  £2(S,  0)  <  a  <  F2(S,  0).  Let  T  conv(S  U  {(o',  0),  (a,  a?)}). 
Then  (T,  0)  £  Bc  and  E(T,  0)  =  (a,  a)  £  P(T ),  so  F(T,  0)  =  (a,  a)  by 
our  earlier  argument  [see  the  line  below  (21.6)].  On  the  other  hand,  by  MON, 
F2(T,  0)  >  F2(S,  0)  >  a,  a  contradiction.  ■ 

An  alternative  characterization  of  the  egalitarian  solution  can  be  obtained  by 
considering  the  following  axioms.  For  a  bargaining  problem  (S',  d),  denote  S&  — 
{x  e  S  I  X  >  d}. 

Super-Additivity  (SA):  F(S  +  T,  d  +  e)  >  F(S,  d)  +  F(T,  e)  for  all  (S,  d),  ( T ,  e)  £ 
Bc.  Here,  S  +  T  {x  +  y  |  x  £  S,  y  £  T}  . 

Independence  of  Non-Individually  Rational  Alternatives  (INIR):  F(S,  d)  = 
F(Sd,  d)  for  all  (S,  d)  £  Bc. 

INIR  is  a  stronger  version  of  the  following  well-known  axiom. 

Individual  Rationality  (IR):  F(S,  d)  >  d  for  all  (S,  d)  £  Bc. 

Observe,  indeed,  that  INIR  implies  IR. 

Theorem  21.4  Let  F  :  Bc  Mr  be  a  bargaining  solution.  Then  the  following  two 
statements  are  equivalent: 

(a)  F  —  E. 

(b)  F  satisfies  WPO,  SA,  SYM,  INIR,  and  TC. 

Proof  (a)=^(b)  follows  from  Theorem  21.3  and  Problem  21.12.  For  the  converse 
implication,  let  (S,  d)  £  Bc .  We  wish  to  show  F(S,  d)  =  E(S,  d).  In  view  of  TC  of 
F  and  E  we  may  assume  d  =  0.  For  every  1  >  s  >  0  let  Vs  :=  {x  £  M2  |  0  < 
x  <  (1  -  e)E(S,  0)}  .  Then  (V£,  0)  £  Bc  and  F(VS,  0)  =  E(Ve,  0)  =  (1  -  e)E(S,  0) 
by  WPO  and  SYM  of  F  and  E.  Define  Ws  =  {x  —  (1  —  s)E(S,  0)  |  x  £  S}  and 
SE  —  W£  +  Vs  for  every  1  >  s  >  0.  Then  So  =  Sq  and  by  SA  we  have 

F(S£,  0)  >  (1  -  e)E(S,  0)  +  F(W£,  0)  , 


hence  by  INIR 


F(S,  0)  >  (1  -  e)E(S,  0)  +  F(W£,  0) 


(21.7) 
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for  all  1  >  s  >  0.  Letting  s  decrease  to  0,  we  obtain  by  (21.7)  and  IR  (which  is 
implied  by  INIR): 


F(S,  0)  >  E(S,  0)  .  (21.8) 

If  E(S,  0)  G  P(S ),  then  (21.8)  implies  F(S,  0)  =  E(S,  0)  and  we  are  done.  Otherwise, 
suppose  without  loss  of  generality  that  E\(S,  0)  =  max{vi  |  x  G  S,  x  >  0}.  Let  z 
be  the  point  of  P(S )  with  ■|  =  £1(5,0),  hence  a  :  =  E2  (S,  0)  —  zi  <  0  since,  by 
assumption,  E(S,  0)  $  P(S).  For  e  >  0,  let  R£  :=  conv{(0,  s),  (0,  a),  (s,  a)}.  Then 
(R£,  0)  G  Bc.  Further,  let  T£  S  +  R£.  By  construction,  E(T£,  0)  G  P(TS ),  hence, 
as  before,  F(T£,  0)  =  E(TE ,  0).  If  s  approaches  0,  F(T£,  0)  converges  to  E(S ,  0)  and 
by  SA  and  IR,  F(T£,  0)  >  F(S,  0).  So  E(S,  0)  >  F(S,  0).  Combined  with  (21.8),  this 
implies  F(S,  0)  =  E(S,  0).  ■ 


21 .4  Noncooperative  Bargaining 

A  different  approach  to  bargaining  is  obtained  by  studying  it  a  as  strategic  process. 
In  this  section  we  discuss  the  basics  of  the  model  of  Rubinstein  (1982)  in  an  informal 
manner.  See  also  Sect.  6.7  for  a  more  elaborate  treatment. 

Point  of  departure  is  a  bargaining  problem  ( S ,  d)  G  B.  Assume  d  =  0  and 
write  S  instead  of  (S',  d).  Suppose  bargaining  takes  place  over  time,  at  moments 
t  —  0, 1, 2, . . .  At  even  moments,  player  1  makes  some  proposal  x  =  (x\,  *2)  G 
P(S)  and  player  2  accepts  or  rejects  it.  At  odd  moments,  player  2  makes  some 
proposal  x  =  (x\,X2)  g  P(S)  and  player  1  accepts  or  rejects  it.  The  game  ends 
as  soon  as  a  proposal  is  accepted.  If  a  proposal  x  =  (x\,  X2)  is  accepted  at  time  t , 
then  the  players  receive  payoffs  (8fx\ ,  Wx 2) .  Here  0  <  6  <  1  is  a  so  called  discount 
factor;  it  reflects  impatience  of  the  players,  for  instance  because  of  foregone  interest 
payments  (‘shrinking  cake’).  If  no  proposal  is  ever  accepted,  then  the  game  ends 
with  the  disagreement  payoffs  of  (0,  0). 

Suppose  player  1  has  in  mind  to  make  some  proposal  y  =  (y\,y2)  G  P(S ),  and 
that  player  2  has  in  mind  to  make  some  proposal  z  =  (zuzi)  £  P(S)-  So  player  1 
offers  to  player  2  the  amount  y2.  Player  2  expects  to  get  zi  if  he  rejects  y,  but  he 
will  get  zi  one  round  later.  So  player  l’s  proposal  y  will  be  rejected  by  player  2  if 
y2  <  8z2 ;  on  the  other  hand,  there  is  no  need  to  offer  strictly  more  than  6zi.  This 
leads  to  the  equation 


yi  =  Sz2  ■ 


(21.9) 


By  reversing  in  this  argument  the  roles  of  players  1  and  2  one  obtains 


zi  =  Sy  1  . 


(21.10) 
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These  two  equations  define  unique  points  y  and  z  in  P(S).  The  result  of  the 
Rubinstein  bargaining  approach  is  that  player  1  starts  by  offering  y,  player  2  accepts, 
and  the  game  ends  with  the  payoffs  y  =  (jq,  y2). 

This  description  is  informal.  Formally,  one  defines  a  dynamic  noncooperative 
game  and  looks  for  the  (in  this  case)  subgame  perfect  Nash  equilibria  of  this  game. 
It  can  be  shown  that  all  such  equilibria  result  in  the  payoffs  y  (or  in  z  if  player  2 
would  start  instead  of  player  1). 

The  surprising  fact  is  that,  although  at  first  sight  the  Rubinstein  approach  is 
quite  different  from  the  axiomatic  approach  by  Nash  (Theorem  21.1)  the  resulting 
outcomes  turn  out  to  be  closely  related.  From  Eqs.  (21.9)  and  (21.10)  one  derives 
easily  that  yijq  —  Z1Z2,  he.,  the  points  y  and  z  are  on  the  same  level  curve  of 
the  function  x  =  ( xi,X2 )  i->  X1X2,  which  appears  in  the  definition  of  the  Nash 
bargaining  solution.  Moreover,  if  the  discount  factor  S  approaches  1,  the  points  y 
and  z  converge  to  one  another  on  the  curve  P(S),  and  hence  to  the  Nash  bargaining 
solution  outcome.  In  words,  as  the  players  become  more  patient,  the  outcome  of  the 
Rubinstein  model  converges  to  the  Nash  bargaining  solution  outcome. 


21 .5  Games  with  Nontransferable  Utility 

Both  TU-games  and  bargaining  problems  are  special  cases  of  NTU-games,  games 
with  nontransferable  utility.  In  an  NTU-game,  the  possibilities  from  cooperation  for 
each  coalition  are  described  by  a  set,  rather  than  a  single  number.  For  a  TU-game 
( N ,  v )  those  sets  can  be  defined  as 

V(S)  =  {x  e  Ms  I  x(S)  <  v(5)} 

for  every  coalition  S.  For  a  two-person  bargaining  problem  ( S ,  d)  the  set  of  feasible 
payoffs  is  S  for  the  grand  coalition  {1,2}  and  (—00,  dj\  for  each  player  i. 

The  core  can  be  extended  to  NTU-games  (for  bargaining  problems  it  is  just  the 
part  of  the  Pareto  optimal  set  weakly  dominating  the  disagreement  outcome).  Also 
the  balancedness  concept  can  be  extended;  the  main  result  here  is  that  balanced 
games  have  a  nonempty  core,  but  the  converse  is  not  true.  For  further  remarks  and 
references  see  the  Notes  section. 

21.6  Problems 

21.1.  Anonymity  and  Symmetry 

Call  a  two-person  bargaining  solution  anonymous  if  F\(S',d')  =  F2OS,  d)  and 
F2(5',d')  =  F\(S,  d)  whenever  (S,  d),  (S',  d')  £  B  with  S'  —  {(x2,x\)  e  M2  | 
(x\,X2)  £  S}  and  (d[,d'2)  —  (d2,d\).  Prove  that  Anonymity  implies  Symmetry  but 
not  vice  versa. 
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21.2.  Revealed  Preference 

Let  Bq  —  {(S,  d)  G  B  |  d  =  (0, 0)}.  For  ( S ,  d)  e  Bo  write  S  instead  of  (, S ,  0).  Let  >: 
be  a  binary  relation  on  M2  and  F  :  Bo  ->  M2  a  solution.  Say  that  >:  represents  F  if 
for  every  S  e  Bo  ’. 


{F(S)}  =  {xeS 


x  ^  y  for  every  y  G  S}  , 


i.e.,  if  F  uniquely  maximizes  >  on  S.  Prove:  F  satisfies  IIA  if  and  only  if  F  can  be 
represented  by  a  binary  relation 


21.3.  The  Nash  Solution  Is  Well-Defined 

Show  that  N  is  well  defined,  i.e.,  that  the  function  (x\  —  d\)(x2  —  df)  takes  its 
maximum  on  {x  G  S  |  x  >  d}  at  a  unique  point. 

21.4.  (a)  =>  ( b )  in  Theorem  21.1 

Show  that  N  satisfies  the  properties  WPO,  SYM,  SC,  and  IIA. 


21.5.  Geometric  Characterization  of  the  Nash  Bargaining  Solution 

Show  that,  for  every  (S,  d)  G  B ,  N(S,  d)  =  z  >  d  if  and  only  if  there  is  a  supporting 
line  of  S  at  z  with  slope  the  negative  of  the  slope  of  the  straight  line  through  d  and  z. 

21.6.  Strong  Individual  Rationality 

Call  a  solution  F  strongly  individually  rational  (SIR)  if  F(S,  d)  >  d  for  all  (, S ,  d)  G 
B.  The  disagreement  solution  D  is  defined  by  D(S,  d)  :=  d  for  every  (S,  d)  g  B. 
Show  that  the  following  two  statements  for  a  solution  F  are  equivalent: 


(a)  F  —  N  or  F  =  D  . 

(b)  F  satisfies  IR,  SYM,  SC,  and  IIA. 


Derive  from  this  that  N  is  the  unique  solution  with  the  properties  SIR,  SYM,  SC, 
and  IIA.  (Hint:  For  the  implication  (b)=Ka),  show  that,  for  every  (S,  d)  G  B ,  either 
F(S,  d)  =  d  or  F(S,  d)  G  W(S).  Also  show  that,  if  F(S,  d)  =  d  for  some  ( S ,  d)  G  B , 
then  F(S,  d)  =  d  for  all  (, S ,  d)  G  B.) 

21.7.  (a)  ( b )  in  Theorem  21.2 

Show  that  the  Raiffa-Kalai-Smorodinsky  solution  has  the  properties  WPO,  SYM, 
SC,  and  IM. 


21.8.  Restricted  Monotonicity 

Call  a  solution  F  :  B  —>  M2  restrictedly  monotonic  (RM)  if  F(S ,  d)  <  F(T,  e) 
whenever  (S,  d),  ( T ,  e)  g  B,  d  =  e,  S  c  T,  u(S,  d)  =  u(T ,  e). 

(a)  Prove  that  IM  implies  RM. 

(b)  Show  that  RM  does  not  imply  IM. 


384 


21  Bargaining  Problems 


21.9.  Global  Individual  Monotonicity 

For  a  problem  (S,d)  e  B ,  g(S )  :=  (max{xi  |  x  e  S},  max{v2  |  x  £  S})  is  called 
the  global  utopia  point  of  S.  Global  Individual  Monotonicity  (GIM)  is  defined  in 
the  same  way  as  IM,  with  the  condition  “ufS,  d)  =  ufT,  e)”  replaced  by:  gi(S)  — 
gi(T).  The  solution  G  :  B  ->  M2  assigns  to  each  (.S',  d)  e  B  the  point  of  intersection 
of  W(S )  with  the  straight  line  joining  d  and  g(S).  Show  that  G  is  the  unique  solution 
with  the  properties  WPO,  SYM,  SC,  and  GIM. 

21.10.  Monotonicity  and  (Weak)  Pareto  Optimality 

(a)  Show  that  there  is  no  solution  satisfying  MON  and  WPO. 

(b)  Show  that,  on  the  subclass  Bo  introduced  in  Problem  21.2,  there  is  no  solution 
satisfying  MON  and  PO.  Can  you  find  a  solution  on  this  class  with  the  properties 
MON  and  WPO? 

21.11.  The  Egalitarian  Solution  (1) 

(a)  Show  that  E  satisfies  MON,  SYM,  and  WPO  (on  Bc). 

(b)  Show  that  E  is  translation  covariant  on  Bc . 

21.12.  The  Egalitarian  Solution  (2) 

Show  that  the  egalitarian  solution  is  super- additive. 

21.13.  Independence  of  Axioms 

In  the  characterization  Theorems  21.1-21.4,  show  that  none  of  the  axioms  used  can 
be  dispensed  with. 

21.14.  Nash  and  Rubinstein 

Suppose  two  players  (bargainers)  bargain  over  the  division  of  one  unit  of  a  perfectly 
divisible  good.  Player  1  has  utility  function  u\(a)  =  a  and  player  2  has  utility 
function  U2(a)  —  1  —  (1  —  a)2  for  amounts  a  e  [0, 1]  of  the  good.  If  they  do  not 
reach  an  agreement  on  the  division  of  the  good  they  both  receive  nothing. 

(a)  Determine  the  set  of  feasible  utility  pairs.  Make  a  picture. 

(b)  Determine  the  Nash  bargaining  solution  outcome,  in  terms  of  utilities  as  well  as 
of  the  physical  distribution  of  the  good. 

(c)  Suppose  the  players’  utilities  are  discounted  by  a  factor  8  e  (0, 1).  Calculate  the 
Rubinstein  bargaining  outcome. 

(d)  Determine  the  limit  of  the  Rubinstein  bargaining  outcome,  for  8  approaching  1 , 
in  two  ways:  by  using  the  result  of  (b)  and  by  using  the  result  of  (c). 
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21.7  Notes 

The  axiomatic  study  of  bargaining  problems  was  initiated  by  Nash  (1950).  For 
comprehensive  surveys  see  Peters  (1992)  or  Thomson  (1994).  Theorem  21.1  is  due 
to  Nash  (1950). 

The  Raiffa-Kalai-Smorodinsky  solution  was  introduced  by  Raiffa  (1953)  and 
axiomatized  by  Kalai  and  Smorodinsky  (1975).  Theorem  21.2  is  a  modified  version 
of  this  characterization. 

For  the  analysis  of  the  noncooperative  bargaining  model  in  Sect.  21.4  see 
Rubinstein  (1982)  or  Sutton  (1986).  For  an  elaborate  discussion  of  noncooperative 
bargaining  models  see  Muthoo  (1999).  The  observation  about  the  relation  with  the 
Nash  bargaining  solution  is  due  to  Binmore  et  al.  (1986). 

The  fact  that  balanced  NTU-games  have  a  nonempty  core  is  due  to  Scarf  (1976). 
A  complete  characterization  of  NTU  games  with  nonempty  core  based  on  a  kind  of 
local  balancedness  condition  is  provided  by  Predtetchinski  and  Herings  (2004). 

Most  other  solution  concepts  for  NTU-games — in  particular  the  Harsanyi  (1963) 
and  Shapley  (1969)  NTU-values,  and  the  consistent  value  of  Hart  and  Mas-Collel 
(1996) — extend  the  Nash  bargaining  solution  as  well  as  the  Shapley  value  for 
TU-games.  An  exception  are  the  monotonic  solutions  of  Kalai  and  Samet  (1985), 
which  extend  the  egalitarian  solution  of  the  bargaining  problem.  See  de  Clippel 
et  al.  (2004)  for  an  overview  of  various  axiomatic  characterizations  of  values  for 
NTU-games,  and  see  Peters  (2003)  for  an  overview  of  NTU-games  in  general.  An 
extensive  textbook  treatment  can  be  found  in  Peleg  and  Sudholter  (2003,  Part  II). 

Most  (though  not  all)  results  of  this  chapter  on  bargaining  can  be  extended  to 
the  n-person  case  without  too  much  difficulty.  This  is  not  true  for  the  Rubinstein 
approach,  the  extension  of  which  is  not  obvious.  One  possibility  is  presented  by 
Hart  and  Mas-Collel  (1996). 
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This  chapter  collects  some  mathematical  tools  used  in  this  book:  (direct)  convex 
separation  results  in  Sects.  22.2  and  22.6;  Lemmas  of  the  Alternative,  in  particular 
Farkas’  Lemma  in  Sect.  22.3;  the  Linear  Duality  Theorem  in  Sect.  22.4;  the  Brouwer 
and  Kakutani  Fixed  Point  Theorems  in  Sect.  22.5;  the  Krein-Milman  Theorem  and 
the  Birkhoff-von  Neumann  Theorem  in  Sect.  22.6;  and  the  Max  Flow  Min  Cut 
Theorem  of  Ford  and  Fulkerson  in  Sect.  22.7. 


22.1  Some  Definitions 

A  subset  Z  c  is  convex  if  with  any  two  points  x,  y  e  Z,  also  the  line  segment 
connecting  x  and  y  is  contained  in  Z.  Formally: 

Vx,yeZV0<A<  1  :  Ax  +  (1  —  A)y  e  Z  . 

If  Z  is  a  closed  set  then  for  convexity  it  is  sufficient  to  check  this  condition  for 
A  =  1/2  (see  Problem  22.1).  It  is  easy  to  see  that  a  set  Z  c  Rn  is  convex  if  and 
only  if  Ylj=  i  £  Z  for  all  x1 * * * , . . . ,  xk  e  Z  and  all  nonnegative  X\ , . . . ,  A^  e  R  with 

Ylj=  i  ^./  —  1  •  Such  a  sum  J2j=\  ls  called  a  convex  combination  of  the  x7  .  For  an 
arbitrary  subset  D  c  R”,  the  convex  hull  of  D  is  the  set  of  all  convex  combinations 
of  elements  of  D  or,  equivalently,  the  smallest  (with  respect  to  set  inclusion)  convex 
subset  of  containing  D. 

For  vectors  x  =  (xu  . .  .,*„),  y  =  (yi, . . .  ,yn)  e  R", 

n 

X  y  :=  L  X,y, 
i=  1 


1 A  set  Z  C  R'7  is  closed  if  it  contains  the  limit  of  every  converging  sequence  in  Z. 
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denotes  the  inner  product  of  x  and  y,  and 


l|x-y||  :=  J2(xi-yd2 

N  i=1 

is  the  Euclidean  distance  between  x  and  y.  A  set  C  c  Rn  is  a  (convex)  cone  if,  with 
each  x,  y  e  C  and  A  e  R,  A  >  0,  also  Ax  e  C  and  x  +  y  e  C. 


22.2  A  Separation  Theorem 

In  this  section  we  derive  the  simplest  version  of  a  separation  result,  namely 
separating  a  point  from  a  convex  set. 

Theorem  22.1  Let  Z  c  M'?  a  closed  convex  set  and  let  x  G  f 1  \  Z.  A 

a  y  e  with  y  •  z  >  y  •  x/c>r  z  e  Z. 


Thus,  this  theorem  states  the  geometrically  obvious  fact  that  a  closed  convex  set  and 
a  point  not  in  that  set  can  be  separated  by  a  hyperplane  (with  normal  y). 


Proof  of  Theorem  22.1  Let  z;gZ  such  that  0  <  |  |x  —  zr\  |  <  |  |x  —  z|  |  for  all  z  e  Z. 
Such  a  z!  exists  by  the  Theorem  of  Weierstrass,  since  the  Euclidean  distance  from 
x  is  a  continuous  function  on  the  set  Z,  and  for  the  minimum  of  z  ->  ||x  —  z|| 
on  Z  attention  can  be  restricted  to  a  compact  (i.e.,  bounded  and  closed)  subset  of 
Z.  Let  y  =  zl  —  x.  Let  z  e  Z.  Lor  any  a,  0  <  a  <  1,  convexity  of  Z  implies 
zl  T  of  (z  —  z')  e  Z,  and  thus 


z!  +  ot(z  —  z!)  —  x 


> 


z  —  X 


Hence, 


2af(z/ 


x)  •  (z 


z')  +  O'2 


>  0  . 


Thus,  letting  a  |  0,  it  follows  that  (z!  —  x)  •  (z  —  z')  >  0.  Lrom  this,  (z;  —  x)  •  z  > 
(z7  -  x)  •  z'  =  (z!  -  x)  •  X  +  (z!  -  x)  •  (zl  -  x)  >  {z!  -  x)  •  x. 

Because  z  was  arbitrary,  it  follows  that  y  •  z  >  y  •  x  for  every  zgZ.  ■ 


Remark  22.2  A  consequence  of  Theorem  22.1  is  that  there  are  real  numbers  a  and 
P  satisfying  y  •  z  >  a  and  y  •  x  <  a,  and  y  •  z  >  and  y  •  x  =  ft,  for  all  z  e  Z 
(notations  as  in  the  theorem).  The  last  assertion  is  trivial.  Lor  the  first  assertion,  note 
that  in  the  proof  of  the  theorem  we  have  y  •  z  >  y  •  zl  for  all  z  e  Z,  so  y  •  zl  is  a  lower 
bound  for  y  •  z.  Then  take,  for  instance,  a  —  ^(y  •  z!  +  y  •  x). 


22.3  Lemmas  of  the  Alternative 


391 


22.3  Lemmas  of  the  Alternative 

Theorem  22. 1  can  be  used  to  derive  several  lemmas  of  the  alternative.  These  lemmas 
have  in  common  that  they  describe  two  systems  of  linear  inequalities  and  equations, 
exactly  one  of  which  has  a  solution. 

Lemma  22.3  (Theorem  of  the  Alternative  for  Matrices)  Let  A  he  an  m  x  n 

matrix.  Exactly  one  of  the  following  two  statements  is  true. 

(a)  There  are  y  G  W1  and  z  G  with  (y,  z)  >  0,  (y,  z)  ^  0  and  Ay  +  z  =  0. 

(b)  There  is  anx  G  M777  with  x  >  0  and  xA  >  0. 

Proof  We  leave  it  to  the  reader  to  prove  that  at  most  one  of  the  systems  in  (a)  and 
(b)  has  a  solution  (Problem  22.2). 

Now  suppose  that  (a)  is  not  true.  It  is  sufficient  to  prove  that  the  system  in  (b) 
must  have  a  solution.  Observe  that  (a)  implies  that  0  is  a  convex  combination  of  the 
columns  of  A  and  the  set  {e-7  g  M777  |  j  —  1 , ...  ,m}.  This  follows  from  dividing  both 
sides  of  the  equation  Ay  +  z  =  0  by  YTj=\  yj  +  J2?=i  H.  Hence,  the  assumption  that 
(a)  is  not  true  means  that  0  ^  Z,  where  Z  c  M777  is  the  convex  hull  of  the  columns  of 
A  and  the  set  {e-7  G  Mm  |  j  —  1, . . . ,  m).  By  Theorem  22.1  and  Remark  22.2  there 
is  an  x  G  Wn  and  a  number  ft  G  R  such  that  x  •  p  >  /3  for  all  p  G  Z  and  x  •  0  =  /3. 
Hence,  /3  =  0  and,  in  particular,  xA  >  0  and  x  >  0  since  the  columns  of  A  and  all 
e7  for  j  —  1 , . . . ,  m  are  elements  of  Z.  Thus,  (b)  is  true.  ■ 

Another  lemma  of  the  alternative  is  Farkas’  Lemma  below.  In  its  proof  we  use  the 
following  result. 

Lemma  22.4  Let  A  be  an  m  x  n  matrix  and  let 

Z  —  { z  £  M77  |  there  exists  anx  G  x  >  0  with  z  =  xA}. 

Then  Z  is  closed. 

Proof 

(a)  Suppose  that  the  rank  of  the  matrix  A,  r(A),  is  equal  to  m.  Then  also  r(AAr)  = 
m,  where  AT  is  the  transpose  of  A  (Problem  22.3).  Therefore,  AAr  is  invertible. 
Let  (z77)„g^  be  a  sequence  in  Z  converging  to  z  e  M77.  Let  z77  =  xnA ,  x77  >  0, 
for  all  n  e  N.  Then  x77  =  xnAAr(AAr)~l  for  every  n,  hence  x77A  ->  z  implies 
x77  ->  z AT(AAT)~l  x,  and  in  particular  x  >  0.  Thus,  since  x77A  ->  xA,  we 
obtain  z  =  xA,  so  that  z  e  Z. 

(b)  Let  h  G  Z\{0}  and  choose  x  G  M/n,  x  >  0,  with  b  =  xA  such  that  \S\  is  maximal, 
where  S  {i  G  {l, ...  ,m}  \xt  >0}.  We  show  that  the  rows  of  A  with  numbers 
in  S  are  linearly  independent.  If  not,  then  there  is  a  ji  e  M777  with  fiA  —  0, 
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fij  ^  0  for  some  j  e  S ,  and  /z,-  =  0  for  all  i  S.  Then  (x  —  /7z)A  =  xA  =  b 
for  every  t  e  R.  Choose  ?  in  such  a  way  that  Xj  —  tfij  >  0  for  all  j  e  S  and 
xj  —  t/ij  =  0  for  some  j  e  S.  Then  b  =  (x  —  ?/z)A,  x  —  tfi  >  0,  and  |{/  e 
{1, . . . ,  m}  |  X/  —  tfii  >  0}|  <  |5|  —  1,  a  contradiction. 

(c)  In  view  of  part  (b)  of  the  proof  we  can  write 


Z  =  U(xB 

B 


B  a  k  x  n  submatrix  of  A,  r(B)  —  k  <  r(A),  0  <  x  e  . 


By  part  (a),  each  of  the  sets  at  the  right  hand  side  of  this  equation  is  closed. 
Hence,  Z  is  the  union  of  finitely  many  closed  sets,  and  therefore  is  closed 
itself.  ■ 


Lemma  22.5  (Farkas’  Lemma)  Let  A  be  an  mxn  matrix  and  b  e  R”.  Exactly  one 
of  the  following  two  statements  is  true. 

(a)  There  is  anx  e  with  x  >  0  and  xA  =  b. 

(b)  There  is  ay  e  Rn  with  Ay  >  0  and  b  •  y  <  0. 

Proof  We  leave  it  to  the  reader  to  show  that  at  most  one  of  the  two  systems  in  (a) 
and  (b)  can  have  a  solution  (Problem  22.4).  Assume  that  the  system  in  (a)  does  not 
have  a  solution.  It  is  sufficient  to  prove  that  the  system  in  (b)  must  have  a  solution. 

The  assumption  that  the  system  in  (a)  does  not  have  a  solution  is  equivalent  to 
the  statement  b  $  Z  where 

Z  —  { zg!"  |  there  exists  an  x  e  Mm,  x  >  0  with  z  =  xA}. 

Clearly,  the  set  Z  is  convex,  and  it  is  closed  by  Lemma  22.4.  By  Theorem  22.1  and 
Remark  22.2  it  follows  that  there  is  ay  g  W  and  an  a  e  R  with  y  •  b  <  a  and 
y  •  z  >  a  for  all  z  e  Z.  Because  0  e  Z  it  follows  that  a  <  y  •  0  =  0,  hence 
y  •  b  <  a  <  0.  To  prove  that  the  system  in  (b)  has  a  solution,  it  is  sufficient  to  prove 
that  Ay  >  0.  Suppose  not,  i.e.,  there  is  an  i  with  (Ay)/  <  0.  Then  e*Ay  <  0,  so 
(Me?)Ay  ->  — oo  as  R  9  M  ->  oo.  Observe,  however,  that  ( Mel)A  e  Z  for  every 
M  >  0,  so  that  ( Mel)Ay  >  a  for  every  such  M.  This  contradiction  completes  the 
proof  of  the  lemma.  ■ 

These  lemmas  can  be  interpreted  geometrically.  We  show  this  for  Farkas’  Lemma  in 
Fig.  22. 1 .  Consider  the  row  vectors  r  /  of  A  as  points  in  Rn .  The  set  of  all  nonnegative 
linear  combinations  of  the  r/  forms  a  cone  C.  The  statement  that  the  system  in  (i)  in 
Lemma  22.5  has  no  nonnegative  solution  means  that  the  vector  b  does  not  lie  in  C. 
In  this  case,  the  lemma  asserts  the  existence  of  a  vector  y  which  makes  an  obtuse 
angle  with  b  and  a  nonobtuse  angle  with  each  of  the  vectors  r /.  This  means  that  the 
hyperplane  L  orthogonal  to  y  has  the  cone  C  on  one  side  and  the  point  b  on  the 
other. 


22.4  The  Duality  Theorem  of  Linear  Programming 
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Fig.  22.1  Geometric 
interpretation  of  Farkas  ’ 
Lemma 


22.4  The  Duality  Theorem  of  Linear  Programming 

In  this  section  we  prove  the  following  theorem. 


Theorem  22.6  (Duality  Theorem  of  Linear  Programming)  Let  A  be  an  n 

matrix ,  b  G  and  c  G  W\  Suppose  V  :=  {x  G  W1  |  xA  >  b,  x  >  0}  ^  0 
W  :=  { ye  |  Ay  <  c,  y  >  0}  ^  0.  Then  min{x-c  |  x  G  V}  =  max{b-y  |  y  G 


x  p 
and 
W}. 


To  prove  this  theorem,  we  first  prove  the  following  variant  of  Farkas’  Lemma. 


Lemma  22.7  Let  A  be  an  m  x  n  matrix  and  b  G  W1.  Exactly  one  of  the  following 
two  statements  is  true. 


(a)  There  is  anx  G  Mm  with  xA  <  b  and  x  >  0. 

(b)  There  is  ay  e  R”  with  Ay  >  0,  b  •  y  <  0,  and  y  >  0. 

Proof  Problem  22.5.  ■ 

The  following  two  lemmas  are  further  preparations  for  the  proof  of  the  Duality 
Theorem. 


Lemma  22.8  Let  x  g  V  and  ye  W  (cf  Theorem  22.6).  Then  x  •  c  >  b  •  y. 
Proof  x  •  c  >  xAy  >  b  •  y. 


Lemma  22.9  Let  x  g  V,  y  e  W  with  x  •  c  =  b  •  y.  Then  x  •  c  =  min{x  •  c  |  x  g  V} 
and  b  •  y  =  max{b  •  y  |  y  g  W}. 


Proof  By  Lemma  22.8,  for  every  xGV:x-c>b-y  =  x-c.  Similarly,  b  •  y  < 
x  •  c  =  b  •  y  for  every  y  eW.  ■ 
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Proof  of  Theorem  22.6  In  view  of  Lemmas  22.8  and  22.9,  it  is  sufficient  to  show 
the  existence  of  x  G  V  and  y  e  W  with  x  •  c  <  b  •  y.  So  it  is  sufficient  to  show  that 
the  system 


(X, y)  (  qA  f  _Cb^  <  (-b. c, 0),  X  >  0,  y  >  0 

has  a  solution.  Suppose  this  is  not  the  case.  By  Lemma  22.7,  there  exists  a  vector 
(z,  w,  t)  G  Rp  x  x  R  with 


(~A  °T  c ) 
V  0  Ar  -b  J 


(z \ 

I  w  I  >  0,  (— b,  c,  0)  •  (z,  w,  t)  <  0,  z  >  0,  w  >  0,  t  >  0  . 


Hence 


Az  <  tc 

(22.1) 

w A  >  tb 

(22.2) 

c  •  w  <  b  •  z  . 

(22.3) 

If  t  —  0,  then  Az  <  0  and  w A  >  0,  hence,  for  x  G  V  and  y  g  W: 

b  •  z  <  xAz  <  0  <  wAy  <  w  •  c 

contradicting  (22.3).  If  t  >  0,  then  by  (22.1)  and  (22.2),  t~lz  G  W  and  t~l  w  G  V. 
By  (22.3),  b  •  (t~lz)  >  ( t  !w)  •  c,  which  contradicts  Lemma  22.8.  Hence,  the  first 
system  above  must  have  a  solution.  ■ 


22.5  Some  Fixed  Point  Theorems 

Let  Z  c  Rn  be  a  nonempty  convex  and  compact  set.  Let /  :  Z  ->  Z  be  a  continuous 
function.  A  point  x*  g  Z  is  a  fixed  point  off  if /(x*)  =  x*. 

If  n  —  1 ,  then  Z  is  a  closed  interval  of  the  form  [a,  b]  c  R,  and  then  it  is  clear  (by 
drawing  a  picture)  that /  must  have  a  fixed  point:  formally,  this  is  a  straightforward 
implication  of  the  intermediate- value  theorem. 

More  generally,  the  following  result  holds. 


2  A  set  Z  C  RH  is  compact  if  it  is  closed  and  bounded.  A  set  Z  C  R”  is  bounded  if  there  is  an 
M  >  0  such  that  1 1  x  1 1  <  M  for  all  x  G  Z. 
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Theorem  22.10  (Brouwer  Fixed  Point  Theorem)  Let  Z  c  R”  be  a  nonempty 
compact  and  convex  set  and  let  f  :  Z  Z  be  a  continuous  function.  Then  f  has 
a  fixed  point. 

A  generalization  of  Brouwer’s  fixed  point  theorem  is  Kakutani’s  fixed  point 
theorem.  Let  F  :  Z  Z  be  a  correspondence ,  i.e.,  F(x)  is  a  nonempty  subset 
of  Z  for  every  x  g  Z.  Call  F  convex-valued  if  F(x)  is  a  convex  set  for  every  x  g  Z. 
Call  F  upper  semi-continuous  if  the  following  holds:  for  every  sequence  (xk)keN  in 
Z  converging  to  x  G  Z  and  for  every  sequence  (; yk)keN  in  Z  converging  to  y  G  Z,  if 
yk  G  F(xk)  for  every  k  G  N,  then  y  g  F(x).  A  point  x*  g  Z  is  a  fixed  point  of  Z  if 
x*  G  F(x*). 

Theorem  22.11  (Kakutani  Fixed  Point  Theorem)  Let  Z  c  W1  be  a  nonempty 
compact  and  convex  set  and  let  F  :  Z  —>  Z  be  an  upper  semi- continuous  and 
convex-valued  correspondence.  Then  F  has  a  fixed  point. 


22.6  The  Birkhoff-von  Neumann  Theorem 


Let  C  be  a  convex  set  in  a  linear  space  V.  An  element  e  G  C  is  called  an  extreme 
point  of  C  if  for  all  x,y  G  C  with  e  —  \(x  +  y)  it  holds  that  x  —  y  (=  e).  By 
ext(C)  the  set  of  extreme  points  of  C  is  denoted.  See  Problem  22.6  for  alternative 
characterizations  of  extreme  points. 

An  n  x  ^-matrix  D  is  called  doubly  stochastic  if  0  <  dij  <  1  for  all  ij  =  1 , . . . ,  n, 
J2j=  i  dij  —  1  for  all  i,  and  YTi=  i  —  1  f°r  all  j.  If  moreover  djj  G  {0, 1}  for  all 
ij  —  1 , . . . ,  22,  then  D  is  called  a  permutation  matrix.  Let  DnXn  denote  the  set  of  all 
22  x  22  doubly  stochastic  matrices,  and  let  PnXn  denote  the  set  of  all  22  x  22  permutation 
matrices.  Note  that  DnXn  is  a  convex  compact  set,  and  that  PnXn  is  a  finite  subset  of 
DnXn.  The  following  theorem  gives  the  exact  relation. 

Theorem  22.12  (Birkhoff-von  Neumann) 


(a)  ext  (DnX}l)  —  PnXn 

(b)  DnXn  =  con v{PnXn) 


Part  (b)  of  Theorem  22.12  follows  from  the  Theorem  of  Krein-Milman  (Theo¬ 
rem  22.14  below).  In  the  proof  of  the  latter  theorem  the  dimension  of  a  subset  of  a 
linear  space  V  plays  a  role.  A  subset  of  V  of  the  form  a  +  L  where  a  g  V  and  L  is 
a  linear  subspace  of  V,  is  called  an  affine  subspace.  Check  that  a  subset  A  of  V  is 
affine  if,  and  only  if,  with  any  two  different  elements  v  and  y  of  A,  also  the  straight 
line  through  v  and  y  is  contained  in  A  (Problem  22.7).  For  an  affine  subspace  a  +  L 
of  V  the  dimension  is  defined  to  be  the  dimension  of  the  linear  subspace  L.  For  an 
arbitrary  subset  A  of  V,  its  dimension  dim(A)  is  defined  to  be  the  dimension  of  the 
smallest  affine  subspace  of  V  containing  the  set  A. 
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The  following  separation  lemma  is  used  in  the  proof  of  the  Theorem  of  Krein- 
Milman.  By  int(C)  and  clo(C)  we  denote  the  (topological)  interior  and  closure  of 
the  set  C  c  W1,  respectively. 

Lemma  22.13  Let  C  be  a  nonempty  convex  subset  ofW1  and  a  £  W1  \  int(C).  Then 
there  exists  a  p  £  W1  \  {0}  with  p  •  a  <  p  •  c  for  every  c  e  C. 

Proof  We  distinguish  two  cases:  (a)  a  clo(C)  and  (b)  a  £  clo(C). 


(a)  Suppose  a  clo(C).  Then  the  result  follows  from  Theorem  22.1,  with  clo(C) 
in  the  role  of  the  set  Z  there. 

(b)  Suppose  a  £  clo(C).  Because  a  f.  int(C),  there  is  a  sequence  a1,  a2, . . .  £ 

R"  \  clo(C)  converging  to  a.  By  Theorem  22.1  again,  for  each  k  there  is  a 
pA  £  W1  \{0}  with  pA  *aA  <  pA  c  for  all  c  £  clo(C),  and  we  can  take  these  vectors 
pA  such  that  |  |pA  1 1  =  1  for  every  k  (|  |  •  1 1  denotes  the  Euclidean  norm).  Because 
{x  £  Rn  I  |  |x|  |  =  1}  is  a  compact  set,  there  exists  a  converging  subsequence 
pA(i) ,  pA(2), . . .  of  p1,  p2, . . .  with  limit,  say,  p.  Then  p -a  =  lim^oo  < 

lim^-^oo  pA(^  •  c  =  p  •  c  for  all  c  £  clo(C).  ■ 


Theorem  22.14  (Krein-Milman)  Let  C  be  a  nonempty  compact  and  convex  subset 
ofW1.  Then  ext(C)  ^  0  and  C  —  conv(ext(C)). 


Proof 


(1)  Because  C  is  compact  and  x  i->  |  |x|  |  (where  1 1  •  1 1  denotes  the  Euclidean  norm) 
is  continuous,  there  exists  by  the  Theorem  of  Weierstrass  an  e  £  C  with  |  |e|  |  = 
maxx€c  |x||.  Then  e  £  ext(C),  which  can  be  proved  as  follows.  Suppose  that 
e  =  ^(x1  +  x2)  for  some  x1,  x2  £  C.  Then 


1  1  9 

-(x1  +  x2) 


1 

<  - 
“  2 


1 

+  2 


1 

<  - 
“  2 


1 

+  2I|C 


implies  |  |x!  1 1  =  |  |x 


\  (x1  +  x2)  1 1 .  By  definition  of  the  Euclidean  norm  this 


is  only  possible  if  x1  =  x2  =  e.  This  shows  e  £  ext(C).  Hence,  ext(C)  ^  0. 

(2)  The  second  statement  in  the  theorem  will  be  proved  by  induction  on  dim(C). 

(a)  If  dim(C)  =  0,  then  C  =  {a}  for  some  a  £  R",  so  ext(C)  =  {a}  and 
conv(ext(C))  =  {a}  =  C. 

(b)  Let  k  £  N,  and  suppose  that  conv(  ext(D))  =  D  for  every  nonempty 
compact  and  convex  subset  D  of  W1  with  dim(D)  <  k.  Let  C  be  a  k- 
dimensional  compact  convex  subset  of  R”.  Obviously,  conv(ext(C))  c  C. 
So  to  prove  is  still:  C  c  conv(ext(C)).  Without  loss  of  generality  assume 
0  £  C  (otherwise,  shift  the  whole  set  C).  Let  W  be  the  smallest  affine  (hence, 
linear)  subset  of  R”  containing  C.  Hence,  dim(W)  =  k.  From  part  (1)  of  the 
proof  there  is  an  e  £  ext(C).  Let  x  £  C.  If  x  =  e  then  x  £  conv(ext(C)). 
If  x  f=-  e  then  the  intersection  of  the  straight  line  through  x  and  e  with  C  is  a 
line  segment  of  which  one  of  the  endpoints  is  e.  Let  b  be  the  other  endpoint. 
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Then  b  is  a  boundary  point  of  C.  Then,  by  Lemma  22.13,  there  is  a  linear 
function/  :  W  ->  R  with/(b)  =  min {/(c)  |  c  e  C}  and/  ^  0  (check  this). 
Let  D  :=  {y  e  C  |  /(y)  =  /(b)}.  Then  D  is  a  compact  and  convex  subset  of 
C.  Because/  ^  0  it  follows  that  dim(D)  <  k.  By  the  induction  hypothesis, 
D  —  conv(ext(D)).  Also,  ext(D)  c  ext(C),  see  Problem  22.8.  Hence, 
b  e  D  —  conv(  ext(D))  c  conv(ext(C)).  Further,  e  e  ext(C).  Because 
x  e  conv{b,  e}  it  follows  that  x  e  conv(ext(C)).  So  C  c  conv(ext(C)).  ■ 

Proof  of  Theorem  22. 12  Because  DnXn  is  compact  and  convex,  part  (b)  follows  from 
part  (a)  and  Theorem  22.14.  So  only  (a)  still  has  to  be  proved. 

(1)  We  first  prove  that  PnXn  ^  ext(DnX«) •  Let  P  =  \py]T=l  be  a  permutation  matrix 

withP  =  |(. A-\-B )  for  some  A,  B  e  DnXn.  Then  py  —  ^( ay-\-by )  andpy  e  {0, 1} 

for  all  ij  e  {1,2,...,  n }.  If  py  —  0  then  ay  —  by  —  0  because  ay,  by  >  0. 

If  p^  —  1  then  ciy  —  by  —  1  because  ay,  by  <  1.  Hence,  A  —  B,  so  that 
P  e  ext (DnXn). 

(2)  Let  now  D  =  [df  e  DnXn  such  that  D  is  not  a  permutation  matrix.  The  proof  is 
complete  if  we  show  that  D  is  not  an  extreme  point.  For  this,  it  is  sufficient  to 
show  that  there  exists  an  n  x  ^-matrix  C  ^  [0]  with 

(i)  Cy  —  0  whenever  dy  —  0  or  dy  —  1 , 

(ii)  YTi=  i  cij  —  0  for  all  j  e  {1,2 , ,n)  with  dy  ^  1  for  every  i, 

(iii)  YTj=\  cij  —  0  for  all  i  e  {1,2 , ,n)  with  dy  ^  1  for  every / 

For  in  that  case,  for  s  >  0  sufficiently  small,  the  matrices  D  +  sC  and  D  —  sC 
are  two  different  doubly  stochastic  matrices  with  D  —  |  ((D  +  sC)  +  {D  —  sC )), 
implying  that  D  $  ext (DnXn). 

We  are  left  to  construct  C.  In  order  to  satisfy  (i),  for  those  rows  or  columns  of 
D  that  contain  a  1  the  corresponding  rows  or  columns  of  C  contain  only  zeros. 
Suppose  there  are  k  rows  (and  hence  columns)  of  D  that  do  not  contain  a  1. 
Because  D  is  not  a  permutation  matrix,  2  <  k  <  n.  In  these  k  rows  there  are  at 
least  2k  elements  unequal  to  0  and  1 .  The  corresponding  2k  or  more  elements 
of  C  are  to  be  chosen  such  that  they  satisfy  the  system  of  2k  homogeneous 
linear  equations  described  in  (ii)  and  (iii).  Without  loss  of  generality  assume 
that  these  equations  correspond  to  the  first  k  rows  and  the  first  k  columns.  Then, 
if  Yl)=  i  cij  =  0  for  all  i  e  {1, . . . ,  k  —  1}  and  Y^i=  i  cij  =  0  for  all  j  e  {1, . . . ,  k}, 

we  have  J2j=  i  ckj  —  0  as  well,  so  that  this  last  equation  is  redundant.  Thus,  we 
have  a  system  with  less  than  2k  independent  equations  and  at  least  2k  variables. 
Hence,  it  has  a  nonzero  solution,  which  gives  the  required  C  [0] .  ■ 


22.7  The  Max-Flow  Min-Cut  Theorem 

A  capacitated  network  is  a  triple  (V,  E ,  k),  where  V  is  a  finite  set  containing  at  least 
two  distinguished  elements  q,  s  e  V  called  source  (q)  and  sink  (s);  E  is  a  subset  of 
V  x  V  such  that  v  w,  v  s,  and  w  ^  q  for  all  (v,  w)  £  E\  and  k  :  E  ->  M+. 
Elements  of  V  are  called  vertices  and  elements  of  E  are  called  edges.  The  number 
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Fig.  22.2  A  capacitated 
network 


k{e)  is  the  capacity  of  the  edge  e\  if  e  —  ( v,w )  then  k{e)  is  interpreted  as  the 
maximal  amount  that  can  flow  from  i;  to  w  through  edge  e.  The  source  has  only 
outgoing  and  the  sink  only  incoming  edges.  See  Fig.  22.2  for  an  example. 

A  flow  in  this  network  is  a  map  /  :  E  ->  M+  with  f(e)  <  k(e)  and  such  that  for 
all  v  e  V  \  {q,  s} 

E,  f(w,v)=  y]  f(v,w) . 

wGV:  ( w,v)£E  weV:  (v,w)£E 


In  other  words,  a  flow  satisfies  the  capacity  constraints  and  for  all  vertices  (except 
source  and  sink)  the  ‘inflow’  equals  the  ‘outflow’. 

The  value  of  a  flow /  is  defined  as  the  inflow  in  the  sink,  i.e.,  as  the  number 

E  ■ 

v€V:  (v,s)£E 


A  flow  is  called  maximal  if  it  has  maximal  value  among  all  possible  flows. 
Intuitively,  the  value  of  a  maximal  flow  is  determined  by  the  ‘bottlenecks’  in  the 
network.  In  order  to  formalize  this,  define  a  cut  in  the  network  to  be  a  partition  of 
V  into  two  subsets  V\  and  V2  such  that  q  e  V \  and  s  e  V r2.  Such  a  cut  is  denoted  by 
(V) ,  V2).  The  capacity  of  a  cut  is  the  number 

k(V i,V2):=  ^  k(v,w)  , 

v£ V\  ,w€V2'-(v,w)€E 


i.e.,  the  total  capacity  along  edges  going  from  Vi  to  V2.  A  cut  is  called  minimal  if  it 
has  minimal  capacity  among  all  possible  cuts. 

In  the  example  in  Fig.  22.2,  a  minimal  cut  has  only  the  sink  in  V2,  and  its  capacity 
is  equal  to  3.  Obviously,  this  is  also  the  value  of  a  maximal  flow,  but  such  a  flow  is 
not  unique. 

Flows  and  cuts  are,  first  of  all,  related  as  described  in  the  following  lemma. 

Lemma  22.15  Letf  be  a  flow  in  the  capacitated  network  (V,  E,  k),  and  let  ip  :  E  —> 
R  be  an  arbitrary  function.  Then: 

(a)  £  £  (piw,  v)  =  £  £  <p(v,w)  . 

v£V  (w,u):  (w,v)€E  v£V  (u,w):  ( v,w)€E 
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(b)  £  f(q,v)=  £  f(v,s) . 

(q,v):  ( q,v)<EE  ( v,s ): 

(c)  For  every  cut  (Vi,  Vf)  va/we  of  the  flow  f  is  equal  to 


X  f(v’ 

(u,w):  (v,w)EE,VEVi,WEV2 


X  /v- w’)  • 

(v,w):  (v,w)EE,VEV 2,WGVi 


Proof  (a)  follows  because  summation  at  both  sides  is  taken  over  the  same  sets.  Part 
(a)  moreover  implies 


X  /(<?>  v)  +  x  /V’ w)  = 

(<7,u):  (q,v)EE  (v,w)GE:  v^q 

X  /v^)+  X  /(v’w) 

(u^):  (v,s)€E  (u,w)e£': 

which  implies  (b)  because  £(„jW)e£:  w)  =  £(t,jW)e£:  w¥=sf{ v,  w)  by  defini- 

tion  of  a  flow  (‘inflow’  equals  ‘outflow’  at  every  vertex  that  is  not  the  source  and  not 
the  sink).  For  part  (c),  let  (Vi ,  Vf)  be  a  cut  of  the  network.  Then 

X  fiv,  w)  -  x  /v> w) 

{v,w)ClE:  t;eVi  (v,w)EE:  v,w€V i 

X  fiv,  w)  +  X  /(u’ H’) 

(v,w)eE:  v=q  (v,w)eE:  wEV  1 

X  /(u’w) 

(v,w)EE:  v,w£V\ 

X  /(v,  w)  +  x  /(*>>  h’) 

(v,w)EE:  v=q  (v,w)eE:  vEV2,w€.V\ 

X  /(v,  w)  +  X  /V’ w)  > 

(v,w)EE:  w=s  (v,w)eE:  v€.V2,w£Vi 


X  f(v,w)  = 

{v,w)ClE:  UGVi,wGV2 


where  the  second  equality  follows  since  everything  that  leaves  from  a  node  of  Vi 
also  has  entered  that  node,  except  for  q\  and  the  fourth  equality  follows  from  (b). 
This  implies  part  (c)  of  the  lemma.  ■ 

The  following  theorem  is  the  famous  Max  Flow  Min  Cut  Theorem. 

Theorem  22.16  Let  (V,  E,  k )  be  a  capacitated  network.  Then  the  value  of  a 
maximal  flow  is  equal  to  the  capacity  of  a  minimal  cut. 

Proof  Let  /  be  a  maximal  flow.  (Note  that  /  is  an  optimal  solution  of  a  fea¬ 
sible  bounded  linear  program,  so  that  existence  of  /  is  guaranteed.)  Part  (c)  of 
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Lemma  22.15  implies  that  the  value  of  any  flow /  is  smaller  than  or  equal  to 


X!  k(v> w) 

( V,W ):  (v,w)EE,V€.Vi  ,wGV2 


(u,w):  (u,w)ei?,i?eV2,weVi 


< 


(u,w):  (v,w)EE,V€.V\  ,wGV2 


for  any  cut  (Vi ,  V2),  so  that  it  is  sufficient  to  find  a  cut  of  which  the  capacity  is  equal 
to  the  value  off. 

For  points  u,w  in  the  network  define  a  path  as  a  sequence  of  different  non- 
directed  edges  starting  in  v  and  ending  in  w;  ‘non-directed’  means  that  for  any  edge 
(x,  y )  G  E ,  (x,  y)  as  well  as  (y,  v)  may  be  used  in  this  path.  Such  a  path  may  be 
described  by  a  sequence  v  =  X\,X2 , ,xm  =  w  with  (*;,*;+ 1)  e  E  or  (xj+ 1 ,  v,)  G  E 
for  every  i  —  1 , . . . ,  m  —  1 .  Call  such  a  path  non-satiated  if  for  every  i  —  1 , . . . , 
m  —  1  it  holds  that/fo, jq-+i)  <  k(v/,v/+i)  if  G  E ,  and/(;q+i,;q)  >  0  if 

(v/+i ,  xi)  g  E.  In  other  words,  the  flow  is  below  capacity  in  edges  that  are  traversed 
in  the  ‘right’  way,  and  positive  in  edges  that  are  traversed  in  the  ‘wrong’  way. 

Define  V\  to  be  the  set  of  vertices  v  for  which  there  is  a  non-satiated  path  from 
q  to  v,  together  with  the  vertex  q,  and  let  V2  be  the  complement  of  V\  in  V.  Then 
s  e  V 2  because  otherwise  there  would  be  a  non-satiated  path  from  q  to  s ,  implying 
that /  would  not  be  maximal;  the  flow /  could  be  increased  by  increasing  it  in  edges 
on  this  path  that  are  traversed  in  the  right  way  and  decreasing  it  in  edges  along  the 
path  that  are  traversed  in  the  wrong  way,  without  violating  the  capacity  constraints 
or  the  inflow-outflow  equalities.  Hence  (Vi,  V2)  is  a  cut  in  the  network. 

Let  (v,y)  G  E  with  v  G  V\  and  y  G  V2.  Then  f(x,y)  =  k(x,y )  because  otherwise 
there  would  be  a  non-satiated  path  from  q  to  a  vertex  in  V2.  Similarly ,/(V ,  /)  =  0 
whenever  (V ,  /)  g  E  with  x!  g  V2  and  /  g  V 1.  By  Lemma  22.15,  part  (c),  the  value 
of  the  flow  /  is  equal  to 


X  /(u> w ) 

(u,w):  (v,w)€E,VEVi,wEV2 


X  f(v’ 

(v,w):  (v,w)€E,v€V2,w€Vi 


hence  to  k(v ,  w),  which  is  by  definition  the  capacity  of  the  cut 

( v,w ):  (v,w)€.E,v£V i,weV2 

(Vi ,  V2) .  This  completes  the  proof.  ■ 


Observe  that  the  proof  of  Theorem  22.16  suggests  an  algorithm  to  determine  a 
maximal  flow,  by  starting  with  an  arbitrary  flow,  looking  for  a  non-satiated  path, 
and  improving  this  path.  By  finding  an  appropriate  cut,  maximality  of  a  flow  can  be 
checked.  Theorem  22.16  is  actually  a  (linear)  duality  result,  but  the  above  proof  is 
elementary. 
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22.8  Problems 

22.1.  Convex  Sets 

Prove  that  a  closed  set  Z  c  Rn  is  convex  if  and  only  if  |x  +  G  Z  for  all  x,  y  G  Z. 

22.2.  Proof  of  Lemma  22.3 

Prove  that  at  most  one  of  the  systems  in  Lemma  22.3  has  a  solution. 

22.3.  Rank  of  AA T 

Let  A  be  an  m  x  n  matrix  with  rank  k.  Prove  that  the  rank  of  AAr  (where  AT  is  the 
transpose  of  A)  is  also  equal  to  k. 

22.4.  Proof  of  Lemma  22.5 

Prove  that  at  most  one  of  the  systems  in  Lemma  22.5  has  a  solution. 

22.5.  Proof  of  Lemma  22.7 
Prove  Lemma  22.7. 

22.6.  Extreme  Points 

Let  C  be  a  convex  set  in  a  linear  space  V  and  let  e  G  C.  Prove  that  the  following 
three  statements  are  equivalent. 

(a)  e  e  ext(C). 

(b)  For  all  0  <  a  <  1  and  all  xje  C,  if  v  ^  y  then  e  ^  ax  +  (1  —  a)y. 

(c)  C  \  {e}  is  a  convex  set. 

22.7.  Affine  Subspaces 

Prove  that  a  subset  A  of  a  linear  space  V  is  affine  if,  and  only  if,  with  any  two 
different  elements  v  and  y  of  A,  also  the  straight  line  through  v  and  y  is  contained 
in  A. 

22.8.  The  Set  of  Sup-points  of  a  Linear  Function  on  a  Convex  Set 

Let  /  :  W1  ->  be  a  linear  function.  Let  C  be  a  convex  subset  of  and  a  := 
sup {f(x)  |  x  G  C},  D  {x  G  C  |  fix)  —  a}.  Show  that  D  is  convex  and  that 
ext(D)  c  ext(C). 


22.9  Notes 

Theorem  22.10  is  due  to  Brouwer  (1912).  For  proofs,  see  also  e.g.  Scarf  (1973)  or 
Vohra  (2005). 

Theorem  22.11  is  from  Kakutani  (1941).  One  way  to  prove  this  theorem  is  to 
derive  it  from  the  Brouwer  Fixed  Point  Theorem:  see,  e.g.,  Hildenbrand  and  Kirman 
(1976). 
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A  good  source  for  results  on  convexity  is  Rockafellar  (1970). 

The  Max  Flow  Min  Cut  Theorem  (Theorem  22.16)  is  due  to  Ford  and  Fulkerson 
(1956). 
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This  chapter  contains  Review  Problems  to  Chaps.  2-10,  organized  per  chapter. 


Chapter  2 

RP  1  Matrix  Games  (1) 

(a)  Give  the  maximin  rows  and  minimax  columns  in  the  following  matrix  game: 

(0  1  1\ 

A  =  2  2  0  . 

\0  0  2/ 

What  can  you  infer  from  this  about  the  value  of  the  game? 

(b)  Calculate  all  (mixed)  maximin  and  minimax  strategies  and  the  value  of  the 
following  matrix  game: 


(c)  Answer  the  same  questions  as  in  (a)  and  (b)  for  the  following  game: 

/I  0  2  3\ 

A  =  J  4  2  1  3  I  . 

\3  1  3  3/ 
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RP  2  Matrix  Games  (2) 


(a)  Consider  the  following  matrix  game: 


/4  2\ 
1  3 
0  5 
\5  1/ 


Determine  all  maximin  rows  and  minimax  columns,  and  also  all  saddlepoint(s), 
if  any.  What  can  you  conclude  from  this  about  the  value  of  this  game? 

(b)  Consider  the  six  different  2  x  2-matrix  games  that  can  be  obtained  by  choosing 
two  rows  from  A,  as  follows: 


A 


l 


Determine  the  values  of  all  these  games.  Which  one  must  be  equal  to  the  value 
of  A?  (Give  an  argument  for  your  answer!) 

(c)  Determine  all  (possibly  mixed)  maximin  and  minimax  strategies  of  A.  [Hint: 
Use  your  answer  to  (b).] 

RP  3  Matrix  Games  (3) 


(a)  Consider  the  following  matrix  game: 


/  8  16  12\ 

A  =  9  8  4  . 

\  12  4  16/ 


Determine  all  maximin  rows  and  minimax  columns,  and  also  all  saddlepoint(s), 
if  any.  What  can  you  conclude  from  this  about  the  value  of  this  game? 

(b)  Give  an  argument  why  player  1  (the  row  player)  will  never  put  probability  on 
the  second  row  in  a  maximin  strategy.  Give  also  an  argument  why,  consequently, 
player  2  will  not  put  probability  on  the  third  column  in  a  minimax  strategy. 

(c)  Determine  all  maximin  and  minimax  strategies  and  the  value  of  A.  [Hint:  Use 
part  (b).] 
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Chapter  3 

RP  4  Bimatrix  Games  (1) 

Consider  the  following  bimatrix  game: 

C  R 
1,1  2,2\ 
0,0  0, 1  . 
0,0  1,0/ 


L 

U  /0,0 
M  3, 1 
D  \  1, 1 


(a)  Simplify  this  game  by  iterated  elimination  of  strictly  dominated  strategies.  Each 
time  you  eliminate  a  pure  strategy,  say  by  which  pure  or  mixed  strategy  the 
eliminated  strategy  is  strictly  dominated  and  why. 

(b)  Find  all  Nash  equilibria  in  the  remaining  game  by  computing  the  best  reply 
functions  and  making  a  diagram. 

(c)  Determine  all  Nash  equilibria  in  the  original  game. 

RP  5  Bimatrix  Games  (2) 

(a)  Consider  the  following  bimatrix  game: 


(A,B) 


f  x,  l  x,  0  \ 

\0,0  2,1/  • 


For  every  rel,  determine  all  Nash  equilibria  (in  mixed  and  pure  strategies)  of 
this  game. 

(b)  Consider  the  following  bimatrix  game: 


e 

/ 

g 

h 

a 

(s,0 

0,1 

0,2 

2, 10\ 

b 

1,7 

3,2 

2,0 

3,1 

c 

3,0 

0,3 

4,5 

4,0 

d  ' 

(  0,  2 

1,1 

1,3 

7,7  / 

Show  that  strategy  /  for  player  2  is  strictly  dominated  by  a  mixed  strategy,  but 
not  by  a  pure  strategy.  Which  strategies  survive  iterated  elimination  of  strictly 
dominated  strategies  in  this  game?  Find  all  Nash  equilibria  in  pure  and  mixed 
strategies  of  this  game. 

RP  6  Voting 

Two  political  parties,  I  and  //,  each  have  four  votes  that  they  can  distribute  over  two 
party-candidates  each.  A  committee  is  to  be  elected,  consisting  of  three  members. 
Each  political  party  would  like  to  see  as  many  as  possible  of  their  own  candidates 
elected  in  the  committee.  Of  the  total  of  four  candidates  those  three  that  have  most 
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of  the  votes  will  be  elected;  in  case  of  ties,  tied  candidates  are  drawn  with  equal 
probabilities. 

(a)  Model  this  situation  as  a  bimatrix  game  between  the  two  parties. 

(b)  Determine  all  (possibly  mixed)  Nash  equilibria  in  this  game. 


Chapter  4 

RP  7  A  Bimatrix  Game 

Consider  the  following  bimatrix  game,  where  a  can  be  any  real  number: 

L  R 

T  fa,  0  0, 1\ 

B  y  0,  1  a,0  J 

(a)  Determine  the  set  of  (mixed  and  pure)  Nash  equilibria  of  this  game  for  every 
value  of  a. 

Suppose  now  that  player  1  (the  row  player)  chooses  first,  and  that  player  2  (the 
column  player)  observes  this  move  and  chooses  next. 

(b)  Write  down  the  extensive  form  of  this  game,  determine  the  associated  strategic 
form,  and  for  every  a  e  R  determine  all  subgame  perfect  equilibria  (in  pure 
strategies)  in  this  game. 

RP  8  An  Ice-cream  Vendor  Game 

Three  ice-cream  vendors  choose  a  location  on  the  beach.  This  beach  has  the 
following  form: 


A 

B 

C 

D 

Each  of  the  four  regions  has  300  customers.  Each  customer  goes  to  the  nearest  ice¬ 
cream  vendor,  but  can  only  move  vertically  or  horizontally  (hence  not  diagonally). 
In  case  of  ties  customers  are  distributed  equally  over  vendors. 

(a)  Suppose  the  three  vendors  simultaneously  and  independently  choose  one  of  the 
four  regions.  Determine  the  Nash  equilibrium  or  equilibria  of  this  game,  if  any. 

(b)  Suppose  vendor  1  chooses  first,  vendor  2  observes  this  and  chooses  next,  and 
vendor  3  observes  both  choices  and  chooses  last.  Determine  the  subgame  perfect 
Nash  equilibrium  or  equilibria  of  this  game,  if  any. 

RP  9  A  Repeated  Game 

Consider  the  three  player  game  where  player  1  chooses  between  U  and  D , 
player  2  between  L  and  R ,  and  player  3  between  A  and  B.  Choices  are  made 
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simultaneously  and  independently,  and  only  pure  strategies  are  considered.  The 
payoffs  are  given  by: 


L  R  L  R 

.  U  (2,2,0  5,5, 5\  U  (4,4,1  4,2, 8\ 

D\  8,6,8  0,7,4 )  D\ 0,2,9  4,2,5/' 

Hence,  if  player  3  plays  A  the  left  matrix  applies  and  if  player  3  plays  B  the  right 
matrix  applies.  The  triples  are  the  payoffs  for  players  1,  2,  and  3,  respectively. 

(a)  Find  the  (pure  strategy)  Nash  equilibria  of  this  game,  if  any. 

(b)  Consider  the  twice  repeated  game  with  as  payoffs  the  sums  of  the  payoffs  in 
the  two  periods.  Suppose  the  players  play  a  subgame  perfect  Nash  equilibrium. 
What  can  you  say  about  the  actions  chosen  in  the  second  period? 

(c)  Consider  again  the  twice  repeated  game  of  (b).  Is  there  a  subgame  perfect  Nash 
equilibrium  of  the  game  in  which  ( U ,  R,  A)  is  played  in  the  first  period?  If  not, 
argue  why  not;  otherwise,  give  a  complete  description  of  such  an  equilibrium. 

(d)  Answer  the  same  question  as  in  (b)  for  the  combination  ( U ,  R,  B ). 

RP  10  Locating  a  Pub 

Two  pub  owners,  player  1  and  player  2,  must  choose  a  location  for  their  pubs  in  a 
long  street.  In  fact,  player  1  wants  to  open  two  new  pubs  in  the  street,  but  player  2 
only  one.  The  possible  locations  are  A,  B  and  C,  where  A  is  right  at  the  beginning  of 
the  street,  B  is  somewhere  in  between  A  and  C,  and  C  is  at  the  very  end.  Between  A 
and  B  there  are  200  potential  customers  (uniformly  distributed),  and  between  B  and 
C  there  are  300  potential  customers  (uniformly  distributed  as  well).  Every  customer 
will  always  go  to  the  pub  that  is  closest  to  his  house. 

The  two  pub  owners  have  agreed  on  the  following  procedure.  First,  player  1  chooses 
a  location  for  his  first  pub.  Next,  player  2  observes  player  l’s  choice  and  chooses  a 
location  for  his  pub.  Player  2  must  choose  a  different  location  than  player  1 .  Finally, 
player  1  obtains  the  remaining  location  for  his  second  pub. 

The  objective  for  both  pub  owners  is  to  maximize  the  number  of  customers. 

(a)  Formulate  this  situation  as  an  extensive  form  game.  How  many  strategies  does 
player  1  have?  And  player  2? 

(b)  Find  the  unique  subgame  perfect  equilibrium  for  this  game. 

Suppose  now  that  they  change  the  procedure  as  follows.  After  player  1  has  chosen 
the  location  for  his  first  pub,  players  1  and  2  simultaneously  choose  one  of  the  two 
remaining  locations.  If  they  happen  to  choose  the  same  location,  then  they  will  enter 
into  a  fierce  fight  and  the  overall  utility  for  both  pub  owners  is  0. 

(c)  Formulate  this  situation  as  an  extensive  form  game.  How  many  strategies  does 
player  1  have?  And  player  2? 
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(d)  Find  a  subgame  perfect  equilibrium  in  which  player  1  starts  by  choosing  A.  Find 
another  subgame  perfect  equilibrium  in  which  player  1  starts  by  choosing  B. 
Find  yet  another  subgame  perfect  equilibrium  in  which  player  1  starts  by 
choosing  C. 

RP  11  A  Two- stage  Game 
Consider  the  two  bimatrix  games 

L  R 

_  U  ( 3,3  0,4\ 

Gl  “0(4,0  1.1  J’ 

(a)  Compute  all  the  pure  strategy  Nash  equilibria  in  G\  and  all  the  pure  strategy 
Nash  equilibria  in  G2. 

Now  suppose  that  G\  is  played  first,  then  the  players  learn  the  outcome  of  G\  and 
next  play  the  game  G2.  For  each  player  the  payoff  is  the  sum  of  the  payoffs  of  each 
game  separately. 


X 

Y 

z 

T  i 

(3,1 

0,0 

0,0 

g2  =  m 

0,0 

3,3 

0,0 

B  ’ 

Wo 

0,0 

1,3 

(b)  In  this  two-stage  game,  how  many  (pure)  strategies  does  each  player  have? 

(c)  Is  there  a  subgame  perfect  Nash  equilibrium  in  which  ( U ,  L)  is  played  in  G\  ?  If 
so,  then  describe  such  an  equilibrium.  If  not,  explain  why. 

(d)  Now  suppose  that  the  order  of  play  is  reversed:  first  G2  is  played,  then  G\. 
Determine  all  subgame  perfect  Nash  equilibria  of  this  two- stage  game. 


Chapter  5 

RP  12  Job  Market  Signaling 1 

A  worker  (W)  has  private  information  about  his  level  of  ability.  With  probability  | 
he  is  a  high  type  (H)  and  with  probability  |  he  is  a  low  type  (L).  After  observing  his 
own  type,  the  worker  decides  whether  to  obtain  a  costly  education  (E)  or  not  (N); 
think  of  E  as  getting  a  degree.  The  firm  (F)  observes  the  worker’s  education  decision 
but  not  the  worker’s  ability.  The  firm  then  decides  whether  to  employ  the  worker  in 
an  important  managerial  job  (M)  or  in  a  much  less  important  clerical  job  (C).  The 
payoffs  are  as  follows.  If  the  worker  gets  job  M  then  he  receives  a  payoff  of  10  but 
has  to  subtract  the  cost  of  education  if  he  has  chosen  E:  this  is  4  for  a  worker  of  type 
H  but  7  for  a  worker  of  type  L.  If  the  worker  gets  job  C  then  he  receives  a  payoff  of 
4  minus  the  cost  of  education  if  he  has  chosen  E.  The  firm  has  a  payoff  of  10  if  it 
offers  the  job  M  to  a  H  type  worker  but  0  if  it  offers  M  to  an  L  type  worker;  and  it 
has  payoff  4  if  it  offers  job  C  to  any  type  worker. 


'From  Watson  (2002). 
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(a)  Represent  this  game  in  extensive  form. 

(b)  Find  all  the  pure  strategy  Nash  equilibria  of  this  game,  if  any.  (You  may  but  do 
not  have  to  use  the  strategic  form.) 

(c)  Which  one(s)  is  (are)  perfect  Bayesian?  Pooling  or  separating?  Provide  the 
associated  beliefs  and  discuss  whether  or  not  the  intuitive  criterion  is  satisfied. 

RP  13  Second-hand  Cars  (1) 

We  consider  the  following  variation  on  the  ‘lemons’  (second  hand  car)  market.  A 
seller  wants  to  sell  his  car.  He  knows  the  quality,  which  can  be  good  or  bad  with 
equal  probabilities.  There  is  also  a  buyer,  who  does  not  know  the  quality.  The  seller 
has  the  choice  between  offering  a  guarantee  certificate  or  not,  next  the  buyer  has  the 
choice  between  buying  the  car  or  not,  having  observed  the  action  of  the  seller.  The 
car  has  a  fixed  price  of  15.  The  value  of  a  bad  or  a  good  car  for  the  buyer  equals  10  or 
20,  respectively  (some  time  after  the  sale,  the  buyer  will  discover  the  quality  of  the 
car).  The  car  does  not  have  any  value  for  the  seller.  Offering  a  guarantee  certificate 
represents  an  expected  transfer  from  the  seller  to  the  buyer  of  0  if  the  car  is  good, 
and  of  10  if  the  car  is  bad  (if  the  buyer  buys  the  car).  If  the  buyer  does  not  buy  the 
car  then  payoffs  are  0  to  both. 

(a)  Model  this  situation  as  a  game  in  extensive  form.  Is  this  a  signaling  game?  Why 
or  why  not? 

(b)  Determine  the  strategic  form  of  this  game,  and  all  Nash  equilibria  in  pure 
strategies,  if  any. 

(c)  Determine  all  perfect  Bayesian  Nash  equilibria.  Which  ones  are  pooling  or 
separating,  if  any? 

RP  14  Second-hand  Cars  (2) 

You  have  seen  a  nice-looking  second-hand  car,  and  you  are  considering  buying  it. 
The  problem  is  that  you  do  not  know  the  precise  value  of  the  car.  Suppose  that  there 
is  a  25  %  chance  that  the  value  of  the  car  is  2,000,  and  there  is  a  75  %  chance  that 
the  value  is  4,000.  You  are  aware  of  these  probabilities.  The  car  seller,  on  the  other 
hand,  knows  the  precise  value  of  the  car.  You  and  the  car  seller  simultaneously  name 
a  price,  and  a  transaction  only  takes  place  if  your  price  is  higher  or  equal  than  the 
price  named  by  the  seller.  In  that  case,  the  eventual  price  to  be  paid  will  be  the 
average  of  your  price  and  the  price  named  by  the  seller. 

For  convenience,  assume  that  both  you  and  the  seller  can  only  name  prices  1,000, 
3,000  and  5,000.  If  you  buy  the  car,  your  utility  would  be  the  value  of  the  car  minus 
the  price  paid.  For  the  seller  the  utility  would  be  the  price  received  minus  the  value 
of  the  car.  If  no  transaction  takes  place,  the  utility  for  both  would  be  zero. 

(a)  Model  this  situation  as  a  game  with  incomplete  information. 

(b)  How  many  types  and  how  many  strategies  do  you  have?  What  about  the  seller? 

(c)  One  of  your  strategies  is  strictly  dominated.  Which  one?  Find  a  pure  strategy,  or 
mixed  strategy,  for  you  that  strictly  dominates  it. 
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(d)  Show  that  there  is  no  Nash  equilibrium  in  which  you  choose  a  price  of  3,000. 

(e)  Find  the  Nash  equilibrium,  or  equilibria,  in  pure  strategies  of  this  game.  Will 
you  eventually  buy  the  car? 

RP  15  Signaling  Games 

Compute  the  strategic  form  and  all  the  pure  strategy  equilibria  in  the  games  in  (a) 
and  (b)  below.  Also  determine  all  perfect  Bayesian  equilibria  in  pure  strategies. 
Which  ones  are  pooling  or  separating,  if  any?  Give  the  corresponding  beliefs.  Apply 
the  intuitive  criterion. 

(a) 


0,1 


2,0 
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RP  16  A  Game  of  Incomplete  Information 
Consider  the  following  two  games  G\  and  G2. 

E  C  D  E 

2,3\  A  (1,2  0,1  3, 0\ 

2,3 )  2  B  y  3,  3  2,4  0,0 )  ' 

At  the  beginning,  a  chance  move  determines  whether  the  payoffs  are  as  in  G\  or 
as  in  G2.  Both  events  happen  with  probability  0.5.  Player  1  knows  whether  G\  is 
played  or  G2,  but  player  2  does  not  know  which  of  the  two  games  is  played.  Finally, 
both  players  simultaneously  choose  an  action. 

(a)  Draw  the  extensive  form  representation  of  this  game.  (That  is,  draw  the  game 
tree.) 

(b)  Compute  the  strategic  form  of  this  game  with  incomplete  information,  and  find 
all  Nash  equilibria  in  pure  strategies. 

Suppose  now  that  player  2  knows  whether  G\  or  G2  is  played,  but  that  player  1  does 
not  know. 

(c)  Find  all  Nash  equilibria  in  pure  strategies  of  this  new  game  with  incomplete 
information.  Use  reasoning  to  find  these  equilibria,  without  making  the  strategic 
form. 

RP  17  A  Bayesian  Game~ 

Two  persons  are  involved  in  a  dispute,  and  each  person  can  either  fight  (F)  or  yield 
(T).  Each  person  has  a  payoff  of  0  from  yielding,  regardless  of  the  other  person’s 
action,  and  a  payoff  of  1  from  fighting  if  the  other  person  yields.  If  both  persons 
fight,  then  the  payoffs  are  —1  to  person  1  and  1  to  person  2.  (This  reflects  the  fact 
that  person  2  is  strong.) 

(a)  Represent  this  game  in  bimatrix  from  and  compute  all  the  Nash  equilibria  (so, 
in  pure  and  mixed  strategies). 

Now  assume  that  person  2  could  also  be  weak ,  in  which  case  the  payoffs  from  both 
persons  fighting  are  equal  to  1  for  person  1  and  —1  for  person  2;  all  other  payoffs 
stay  the  same.  Suppose  that  person  2  knows  whether  he  is  weak  or  strong,  but  person 
1  only  knows  that  person  2  is  strong  with  probability  a. 

(b)  Represent  this  game  in  extensive  form. 

(c)  Determine  the  strategic  form  of  the  game  in  (b)  and  compute  all  Nash  equilibria 
in  pure  strategies,  if  any.  (Your  answer  may  depend  on  the  value  of  a,  so  you 
may  have  to  distinguish  cases.) 


Gi  = 


2From  Osborne  (2004). 
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RP  18  Entry  as  a  Signaling  Game 3 

A  challenger  contests  an  incumbent’s  market.  The  challenger  is  strong  with 
probability  1/4  and  weak  with  probability  3/4;  it  knows  its  type,  but  the  incumbent 
does  not.  The  challenger  may  either  prepare  itself  for  battle,  or  remain  unprepared 
(it  does  not  have  the  option  of  staying  out).  The  incumbent  observes  whether  the 
challenger  is  prepared  or  not,  but  not  its  type,  and  chooses  whether  to  fight  or 
acquiesce.  An  unprepared  challenger’s  payoff  is  5  if  the  incumbent  acquiesces  to 
its  entry.  Preparations  cost  a  strong  challenger  1  unit  of  payoff  and  a  weak  one  3 
units,  and  fighting  entails  a  loss  of  2  units  for  each  type.  The  incumbent  prefers  to 
fight  (payoff  2)  rather  than  to  acquiesce  to  (payoff  0)  a  weak  challenger  (who  is 
quickly  dispensed  with),  and  prefers  to  acquiesce  to  (payoff  2)  rather  than  to  fight 
(payoff  0)  a  strong  one. 

(a)  Represent  this  game  in  extensive  form. 

(b)  Find  all  the  pure  strategy  Nash  equilibria  of  this  game,  if  any.  (You  may  but  do 
not  have  to  use  the  strategic  form.) 

(c)  Which  one(s)  is  (are)  perfect  Bayesian?  Pooling  or  separating?  Provide  the 
associated  beliefs  and  discuss  whether  or  not  the  intuitive  criterion  is  satisfied. 


Chapter  6 

RP  19  Bargaining  (1) 

Consider  the  following  two-player  bargaining  game.  Player  1  owns  an  object  which 
has  worth  0  to  him.  The  object  has  worth  v  e  [0, 1]  to  player  2:  player  2  knows  v 
but  player  1  only  knows  that  v  has  been  drawn  according  to  the  uniform  distribution 
over  [0, 1].  There  are  two  periods.  In  period  1,  player  1  makes  a  price  offer  p\  and 
player  2  responds  with  “yes”  or  “no”.  If  player  2  rejects  player  1  ’s  offer,  then  player 
2  makes  the  price  offer  p2  in  the  second  period,  to  which  player  1  can  say  “yes”  or 
“no”.  Agreement  in  the  first  period  yields  the  payoff  p\  to  player  1  and  v  —  p\  to 
player  2.  Agreement  in  the  second  period  yields  8p2  to  player  1  and  8(v  —  P2)  to 
player  2  (0  <  8  <  1  is  a  discount  factor).  No  agreement  yields  0  to  both. 

(a)  Explain  that  this  is  a  game  of  incomplete  information.  What  are  the  type  sets  of 
the  players?  Describe  the  strategy  set  of  player  1. 

(b)  Describe  the  strategy  set  of  player  2. 

(c)  Suppose  player  1  offers  a  price  p\  in  period  1.  Give  the  best  response  of  player 
2  (assume  that  players  1  and  2  accept  in  case  of  indifference). 

(d)  Compute  the  perfect  Bayesian  Nash  equilibrium  of  this  game. 

RP  20  Bargaining  (2) 

Suppose  two  players  (bargainers)  bargain  over  the  division  of  one  unit  of  a 
perfectly  divisible  good.  Player  1  has  utility  function  u\(a)  —  a  and  player  2 


3 From  Osborne  (2004). 
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has  utility  function  U2(/3)  —  1  —  (1  —  /3 )2  for  amounts  a,  e  [0, 1]  of  the 
good. 

(a)  Determine  the  set  of  feasible  utility  pairs.  Make  a  picture. 

(b)  Determine  the  Nash  bargaining  solution  outcome,  in  terms  of  utilities  as  well  as 
of  the  physical  distribution  of  the  good. 

(c)  Suppose  the  players’  utilities  are  discounted  by  a  factor  S  E  [0, 1).  Calculate  the 
Rubinstein  bargaining  outcome. 

(d)  Determine  the  limit  of  the  Rubinstein  bargaining  outcome,  for  S  approaching  1 , 
in  two  ways:  by  using  the  result  of  (b)  and  by  using  the  result  of  (c). 

RP  21  Bargaining  (3) 

Suppose  two  players  (bargainers)  bargain  over  the  division  of  one  unit  of  a  perfectly 
divisible  good.  Assume  that  utilities  are  just  equal  to  the  amounts  of  the  good 
obtained,  discounted  by  a  common  discount  factor  0  <  8  <  1 .  The  players  play 
a  finite  Rubinstein  bargaining  game  over  three  periods  t  —  0,  1, 2,  player  1  starting 
to  make  an  offer  at  time  t  —  0,  player  2  making  an  offer  at  time  t  —  1  (if  reached), 
and  the  game  ending  at  time  t  —  2  (if  reached)  with  equal  split  (|,  ^). 

(a)  Calculate  the  backwards  induction  outcome  of  this  game.  Argue  that  player  1 
has  a  beginner’s  advantage. 

Suppose  again  that  two  players  (bargainers)  bargain  over  the  division  of  one  unit  of 
a  perfectly  divisible  good.  Assume  that  player  1  has  utility  function  u(x)  (0  <  v  <  1) 
and  player  2  has  utility  function  v(x)  —  2u(x)  (0  <  x  <  1). 

(b)  Determine  the  physical  distribution  of  the  good  according  to  the  Nash  bargain¬ 
ing  solution.  Can  you  say  something  about  the  resulting  utilities? 

RP  22  Ultimatum  Bargaining 

Consider  the  ultimatum  bargaining  game  where  player  1  offers  a  division  (1  —  m,  m) 
of  one  Euro,  and  player  2  can  accept  (receiving  m  whereas  player  receives  1  —  m)  or 
reject  (in  which  both  players  receive  zero).  Suppose  that  player  1  only  cares  about 
how  much  money  he  receives,  but  player  2  also  cares  about  the  division:  specifically, 
if  the  game  results  in  monetary  payoffs  (x\ ,  X2),  then  the  (utility)  payoff  to  player  2 
is  X2  +  a(x2  —  x\),  where  a  is  some  positive  constant. 

(a)  Represent  this  game  in  extensive  form,  writing  the  payoffs  in  terms  of  m  and  a. 

(b)  Determine  the  subgame  perfect  Nash  equilibrium. 

(c)  What  happens  to  the  equilibrium  monetary  split  as  a  becomes  large?  What  is  the 
explanation  for  this? 

RP  23  An  Auction  (1) 

Two  players  (bidders)  have  valuations  v\  >0  and  i>2  >  0  for  an  object.  They 
simultaneously  and  independently  announce  bids  b\,  Z?2  €  [0,  00).  The  player  with 
the  highest  bid  obtains  the  object  and  has  a  payoff  vt  —  bt,  the  other  player  has 


414 


23  Review  Problems  for  Part  I 


payoff  0.  If  the  bids  are  equal,  then  each  obtains  the  object  and  associated  payoff 
with  probability  \ . 

(a)  Does  this  game  have  perfect  or  imperfect  information?  Complete  or  incomplete 
information? 

(b)  Suppose  that  v\  —  i>2.  Determine  all  Nash  equilibria  (in  pure  strategies),  if  any. 

(c)  Suppose  that  V\  >  Determine  all  Nash  equilibria  (in  pure  strategies),  if  any. 

(d)  Suppose  that  v\  —  1  and  V2  —  3.  Also  suppose  that  only  integer  numbers  up 
to  3  are  allowed  as  bids:  b\,b2  E  {0, 1, 2,  3}.  Represent  this  game  as  a  bimatrix 
game  and  solve  for  all  pure  Nash  equilibria. 

RP  24  An  Auction  (2) 

n  >  4  players  participate  in  an  auction  for  an  object  for  which  their  evaluations  are 
v\  >  v>2  >  ...  >  vn.  It  is  a  sealed-bid  auction,  and  the  highest  bidder  obtains  the 
object  and  pays  the  fourth-highest  bid.  In  case  of  a  tie  among  highest  bidders,  the 
player  with  the  lowest  number  among  the  highest  bidders  obtains  the  object. 

(a)  Show  that  for  any  player  i  the  bid  of  U;  weakly  dominates  any  lower  bid  but  does 
not  weakly  dominate  any  higher  bid. 

(b)  Show  that  a  strategy  profile  in  which  each  player  bids  his  true  valuation  is  not  a 
Nash  equilibrium. 

(c)  Find  all  Nash  equilibria  in  which  all  players  submit  the  same  bid. 

RP  25  An  Auction  (3) 

Two  individuals  (players)  participate  in  the  auction  of  a  painting.  The  painting 
has  worth  6  for  player  1  and  4  for  player  2.  The  individuals  simultaneously  and 
independently  submit  their  bids  b\  and  where  these  bids  can  only  be  whole 
numbers:  b\,  Z?2  E  {0, . . . ,  6}  (higher  bids  do  not  make  sense).  The  highest  bidder 
wins  the  auction,  receives  the  painting  and  pays  his  bid,  whereas  the  other  player 
receives  and  pays  nothing.  In  case  of  a  draw  player  1  wins.  Hence,  the  payoff  to 
player  1  is  6  —  b\  if  b\  >  £>2  and  0  if  b\  <  b 2,  and  the  payoff  to  player  2  is  4  —  Z?2  if 
£>2  >  b\  and  0  if  b2  <  b\. 

(a)  Compute  the  best  reply  functions  of  both  players,  and  draw  a  diagram. 

(b)  Compute  the  Nash  equilibrium  or  equilibria  of  this  game,  if  any. 

RP  26  Quantity  Versus  Price  Competition 

Suppose,  in  the  Cournot  model,  that  the  two  firms  produce  heterogenous  goods, 
which  have  different  market  prices.  Specifically,  suppose  that  these  market  prices 
are  given  by 

pi  =  max {4  -  2qi  -  q2 ,  0},  p2  =  max{4  -  q\  -  2q2 ,  0}  .  (*) 

These  are  the  prices  of  the  goods  of  firms  1  and  2,  respectively.  The  firms  compete 
in  quantities.  Both  fixed  and  marginal  costs  are  assumed  to  be  zero. 
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(a)  Write  down  the  payoff  (  =  profit)  functions  of  the  two  firms  and  compute  the 
Nash  equilibrium  quantities. 

(b)  Use  (*)  to  show  that 

qi  =  max{l(/?2  —  2/?i  +  4), 0},  q2  —  max{l(pi  -  lp2  +  4),  0}  .  (**) 

Assume  now  that  the  firms  compete  in  prices,  with  demands  given  by  (**). 

(c)  Write  down  the  payoff  (  =  profit)  functions  of  the  two  firms  in  terms  of  prices 
and  compute  the  Nash  equilibrium  prices. 

(d)  Compare  the  equilibria  found  under  (a)  and  (c).  Specifically,  compare  the 
quantities  and  (associated)  prices  under  (a)  to  the  (associated)  quantities  and 
prices  under  (c).  What  about  the  associated  profits?  Which  is  tougher:  quantity 
or  price  competition? 

RP  27  An  Oligopoly  Game  (1) 

Three  oligopolists  operate  in  a  market  with  inverse  demand  function  given  by 
P(Q )  =  a  —  Q,  where  Q  —  q\  +  q2  +  qs  and  qt  is  the  quantity  produced  by  firm  i. 
Each  firm  has  a  constant  marginal  cost  of  production,  0  <  c  <  a,  and  no  fixed  cost. 
The  firms  choose  their  quantities  as  follows:  (1)  firm  1  chooses  q\  >  0;  (2)  firms  2 
and  3  observe  q\  and  then  simultaneously  choose  <72  and  #3,  respectively. 

(a)  Draw  a  picture  of  the  extensive  form  of  this  game.  Also  give  the  strategic  form: 
describe  the  strategy  spaces  of  the  players  and  the  associated  payoff  functions. 

(b)  Determine  the  subgame  perfect  Nash  equilibrium  of  this  game. 

RP  28  An  Oligopoly  Game  (2) 

Consider  the  Cournot  model  with  three  firms.  Each  firm  i  =  1, 2,  3  offers  qi  >  0, 
and  the  market  price  of  the  good  is  P (q\ ,  #2,  #3)  =  10  —  q\  —  <72  —  <73  (or  zero  if 
this  amount  should  be  negative).  Firms  1  and  2  have  marginal  costs  equal  to  0  while 
firm  3  has  marginal  cost  equal  to  1 .  We  assume  that  the  firms  are  involved  in  Cournot 
quantity  competition. 

(a)  Derive  the  reaction  functions  of  the  three  firms. 

(b)  Compute  the  Nash  equilibrium  of  this  game. 

(c)  Determine  the  maximal  joint  profit  the  three  firms  can  achieve  by  making  an 
agreement  on  the  quantities  q\ ,  <72,  and  #3.  Is  such  an  agreement  unique? 

RP  29  A  Duopoly  Game  with  Price  Competition 

Two  firms  (1  and  2)  sell  one  and  the  same  good.  They  engage  in  price  competition 
a  la  Bertrand.  The  demand  for  the  good  at  price  p  is  100  —  p.  The  firm  that  sets  the 
lower  price  gains  the  whole  market;  in  case  of  equal  prices  the  market  is  split  evenly. 
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The  unit  marginal  cost  for  firm  1  is  c\  —  30,  whereas  for  firm  2  it  is  C2  —  50.  Prices 
are  in  whole  units,  i.e.,  p  E  {0, 1, . . (So,  e.g.,  a  price  of  20.5  is  not  allowed.) 

(a)  Write  down  the  profit  functions  of  both  firms,  and  compute  the  monopoly  prices. 

(b)  Derive  the  reaction  functions  of  both  firms. 

(c)  Compute  the  Nash  equilibrium  in  which  the  price  of  firm  1  is  minimal  among 
all  Nash  equilibria. 

(d)  Compute  the  Nash  equilibrium  in  which  the  price  of  firm  1  is  maximal  among 
all  Nash  equilibria. 

RP  30  Contributing  to  a  Public  Good 

Three  people  simultaneously  decide  whether  or  not  to  contribute  to  a  public  good. 
At  least  two  contributions  are  needed  in  order  to  provide  the  public  good.  Suppose 
that  the  public  good  has  a  value  of  8  units  to  every  person  if  it  is  provided,  and 
that  the  contribution  is  fixed  at  3  units.  If  a  person  decides  to  contribute,  then  the 
contribution  must  be  paid  also  if  the  good  is  not  provided. 

(a)  Find  all  Nash  equilibria  in  pure  strategies  of  this  game. 

(b)  Suppose  now  that  players  2  and  3  use  the  same  mixed  strategy,  in  which 
they  contribute  with  probability  p.  Show  that  player  l’s  expected  payoff  of 
contributing  is  equal  to 


1 6p  —  8 p2  —  3  . 

(c)  Compute  the  two  symmetric  mixed  strategy  Nash  equilibria  of  this  game. 

RP  31  A  Demand  Game 

Three  players  divide  one  perfectly  divisible  Euro.  The  players  simultaneously 
submit  their  demands,  x\,  X2,  and  X3,  respectively,  where  0  <  xt  <  1  for  each 
player  i  =  1, 2,  3.  If  the  sum  of  these  demands  is  smaller  than  or  equal  to  1,  i.e., 
x\  +  V2  +  V3  <  1,  then  each  player  i  obtains  his  demand  xg,  otherwise,  i.e.,  if 
x\  +  X2  +  .*3  >  1,  then  each  player  obtains  0. 

First,  for  each  of  the  following  amounts,  either  exhibit  a  Nash  equilibrium  of 
this  game  with  sum  of  demands  equal  to  that  amount,  or  show  that  such  a  Nash 
equilibrium  does  not  exist. 

(a)  0.9;  (b)  1.2;  (c)  1.5;  (d)  1.8 

(e)  Determine  all  Nash  equilibria  of  this  game,  i.e.,  all  triples  (x\,  X2,x^)  with  0  < 
xi,X2,%3  <  1  so  that  no  player  can  do  better  given  the  demands  of  the  other 
players. 
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Chapter  7 

RP  32  A  Repeated  Game  (1) 
Consider  the  following  bimatrix  game 


L  R 

T  f  16,  24  0,25  \ 

B  \  0,  18  16, 16  )  ' 

(a)  Which  set  of  payoffs  can  be  reached  as  the  long  run  average  payoffs  in  subgame 
perfect  Nash  equilibria  of  the  infinitely  repeated  discounted  game  G°°{8)  for 
suitable  choices  of  81 

(b)  Same  question  as  under  (a),  but  now  for  Nash  equilibrium  (not  necessarily 
subgame  perfect). 

(c)  Describe  a  Nash  equilibrium  of  G°°  (5)  resulting  in  the  long  run  average  payoffs 
(16,  20).  Is  there  any  value  of  8  for  which  this  equilibrium  is  subgame  perfect? 
Why  or  why  not? 

RP  33  A  Repeated  Game  (2) 

Consider  the  following  stage  game  G: 

L  C  R 

U  /  8,8  0,9  4, 1  \ 

M  9,1  2, 1  4,2  . 

D  y  10, 3  2,4  4,4/ 


Player  1  is  the  row  player  and  player  2  the  column  player. 

(a)  Compute  all  the  pure  strategy  Nash  equilibria  of  G. 

(b)  Compute  all  mixed- strategy  Nash  equilibria  of  the  game  G. 

(c)  Suppose  that  the  game  G  is  played  twice,  with  as  payoffs  the  sums  of  the  payoffs 
of  the  two  stages.  Determine  the  number  of  (pure)  strategies  of  each  player.  Is 
there  a  subgame  perfect  (pure  strategy)  Nash  equilibrium  of  the  twice  repeated 
game  in  which  (G,  L)  is  played  in  the  first  stage?  (Describe  such  an  equilibrium 
or  argue  that  it  cannot  exist.) 

(d)  Consider  the  infinite  repetition  of  the  game  G  in  which  the  players  use  the 
discounted  sums  of  payoffs  to  evaluate  the  outcome.  The  players  have  a 
common  discount  factor  0  <  8  <  1.  Describe  a  subgame  perfect  Nash 
equilibrium  of  the  repeated  game  in  which  always  (G,  L)  is  played,  using  trigger 
strategies.  Also  compute  the  minimal  value  of  8  for  which  this  equilibrium 
exists. 
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RP  34  A  Repeated  Game  (3) 
Consider  the  following  stage  game  G : 


L 

c 

R 

U  i 

(  0,0 

11,0 

6,1\ 

M 

0,12 

10, 10 

5,5  . 

D  ' 

l  1,6 

12,4 

6,6/ 

Player  1  is  the  row  player  and  player  2  the  column  player. 

(a)  Compute  all  the  pure  strategy  Nash  equilibria  of  G. 

(b)  Compute  all  mixed- strategy  Nash  equilibria  of  the  game  G. 

(c)  Suppose  that  the  game  G  is  played  twice,  with  as  payoffs  the  sums  of  the  payoffs 
of  the  two  stages.  Determine  the  number  of  (pure)  strategies  of  each  player.  Is 
there  a  subgame  perfect  (pure  strategy)  Nash  equilibrium  of  the  twice  repeated 
game  in  which  (M,  C)  is  played  in  the  first  stage?  (Describe  such  an  equilibrium 
or  argue  that  it  cannot  exist.) 

(d)  Consider  the  infinite  repetition  of  the  game  G  in  which  the  players  use  the  dis¬ 
counted  sums  of  payoffs  to  evaluate  the  outcome.  The  players  have  a  common 
discount  factor  0  <  8  <  1.  Describe  a  subgame  perfect  Nash  equilibrium  of  the 
repeated  game  in  which  always  (M,  C)  is  played,  using  trigger  strategies.  Also 
compute  the  minimal  value  of  8  for  which  this  equilibrium  exists. 

RP  35  A  Repeated  Game  (4) 

Consider  the  bimatrix  game 


L  R 

T  ( 4,10  3,9\ 

B\  5,5  3,4)  ' 

Suppose  this  game  is  played  twice.  After  each  play  of  the  game  the  players  learn  the 
outcome.  The  total  payoff  is  the  sum  of  the  payoffs  of  each  of  the  two  plays  of  the 
game. 

(a)  Describe  the  subgame  perfect  equilibrium  or  equilibria  in  pure  strategies  of  this 
twice  repeated  game,  if  any.  Also  give  the  associated  outcome(s)  and  payoffs. 

(b)  Is  there  a  Nash  equilibrium  of  the  twice  repeated  game  in  which  the  combination 
(T,L)  is  played  the  first  time?  If  so,  describe  such  an  equilibrium  and  the 
associated  outcome  and  payoffs.  If  not,  explain  why. 

Now  assume  that  the  game  is  repeated  infinitely  many  times.  Payoffs  are  the 
discounted  sums  of  stage  payoffs,  with  common  discount  factor  0  <  8  <  1 . 

(c)  Which  long  run  average  payoffs  can  be  obtained  in  a  subgame  perfect  equilib¬ 
rium  in  (pure)  trigger  strategies  of  this  infinitely  repeated  game  for  a  suitably 
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chosen  value  of  81  Give  an  example  of  such  a  subgame  perfect  equilibrium,  and 
also  give  the  associated  outcome,  payoffs,  and  long-run  average  payoffs,  as  well 
as  the  values  of  8  for  which  this  is  an  equilibrium. 

RP  36  A  Repeated  Game  (5) 

Consider  the  following  bimatrix  game: 


L  R 


(a)  Determine  all  Nash  equilibria  (in  pure  or  mixed  strategies)  in  this  game. 

Now  suppose  that  the  game  is  infinitely  repeated,  at  times  t  —  0, 1, 2, . . .,  and 
that  the  players  learn  the  outcome  after  each  play  of  the  game.  There  is  a  common 
discount  factor  0  <  8  <  1,  and  the  payoffs  are  the  discounted  streams  of  payoffs. 

(b)  Which  pairs  of  payoffs  can  be  reached  as  long-run  average  payoffs  in  a  subgame 
perfect  equilibrium  (in  trigger  strategies)  in  this  game,  assuming  that  we  can  take 
8  as  close  to  1  as  desired? 

(c)  Give  a  subgame  perfect  equilibrium  of  the  infinitely  repeated  game  resulting 
in  the  average  payoffs  (2|,  3).  Give  the  values  of  8  for  which  your  strategies 
indeed  form  a  subgame  perfect  equilibrium. 


Chapter  8 

RP  37  An  Evolutionary  Game 
Consider  the  following  matrix 


C  D 


(a)  Suppose  this  matrix  represents  an  evolutionary  game  between  animals  of  the 
same  species.  Give  an  interpretation  of  a  mixed  strategy  (p ,  1  —  p). 

(b)  Determine  the  replicator  dynamics,  rest  points  and  stable  rest  points,  and  evolu¬ 
tionary  stable  strategies.  Include  a  phase  diagram  for  the  replicator  dynamics. 

(c)  For  the  evolutionary  stable  strategy  or  strategies,  show  directly  (that  is,  without 
using  the  replicator  dynamics)  that  the  strategy  (or  strategies)  is  (or  are) 
evolutionary  stable. 


Chapter  9 


RP  38  An  Apex  Game 
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A  voting  committee  consists  of  five  members.  Player  1  is  a  major  player,  called  the 
apex  player.  The  other  players  are  called  minor  players.  In  order  to  pass  a  decision 
one  needs  the  consent  of  either  the  apex  player  and  at  least  one  minor  player,  or  of 
the  four  minor  players.  Therefore,  a  coalition  that  contains  either  the  apex  player 
and  at  least  one  minor  player,  or  all  minor  players,  is  called  winning.  We  model  this 
situation  as  a  so-called  simple  game :  winning  coalitions  obtain  worth  1,  all  other 
coalitions  worth  0. 

(a)  Is  the  core  of  this  game  empty  or  not?  If  not,  then  compute  it. 

(b)  Compute  the  Shapley  value  of  this  game.  (Use  the  symmetry.) 

(c)  Compute  the  nucleolus  of  this  game.  (Use  the  symmetry.) 

RP  39  A  Three-person  Cooperative  Game  (1) 

A  three-person  cooperative  game  with  player  set  {1,2,3}  is  described  in  the 
following  table: 


S  0  {1}  {2}  {3}  {1,2}  {1,3}  {2,3}  {1,2,3} 
v(S)  0  0  1  0  3  3  a  10 


(a)  Determine  all  values  of  a  for  which  the  core  of  this  game  is  not  empty. 

(b)  Determine  the  Shapley  value  of  this  game.  For  which  values  of  a  is  it  in  the 
core? 

(c)  Determine  all  values  of  a ,  if  any,  for  which  the  vector  ( l6~2a ,  2±*?  2±^)  is  the 
nucleolus  of  this  game. 

RP  40  A  Three-person  Cooperative  game  (2) 

For  each  a  eR  the  three-person  cooperative  game  ({1 , 2,  3},  va)  is  given  by  va  (1)  = 
a ,  va (2)  =  va(3)  =  0,  va{\2)  =  2,  ua(13)  =  3,  va(23)  =  4,  and  va(N)  = 
va(\23)  =  5. 

(a)  For  which  values  of  a  is  the  core  of  va  non-empty?  For  these  values,  compute 
the  core. 

(b)  Compute  the  Shapley  value  of  va.  For  which  values  of  a  is  it  in  the  core  of  v ep¬ 
ic)  For  which  values  of  a  are  the  maximal  excesses  at  the  nucleolus  of  va  reached 

by  the  two-person  coalitions?  For  those  values,  compute  the  nucleolus. 

RP  41  Voting 

Suppose  in  Parliament  there  are  four  parties  A,  B,C,D  with  numbers  of  votes  equal 
to  40,30,20, 10,  respectively.  To  pass  any  law  an  absolute  majority  (>50%)  is 
needed. 

(a)  Formulate  this  situation  as  a  four-person  cooperative  game  where  winning 
coalitions  (coalitions  that  have  an  absolute  majority)  have  worth  1  and  losing 
coalitions  worth  0.  Determine  the  Shapley  value  of  this  game. 
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For  every  party  X  in  {A,  B,  C,  D}  let  px  denote  the  number  of  coalitions,  containing 
X ,  that  are  winning  but  would  be  losing  without  X.  Define  /3  (X)  =  +  ^  +  for 
every  party  X. 

(b)  Compute  /3(X)  for  every  X  G  {A,  B ,  C,  D }. 

Consider  now  a  Parliament  with  three  parties  A,  B,C  and  numbers  of  votes  equal  to 
20, 10, 10,  respectively.  To  pass  any  law  a  two-third  majority  is  needed. 

(c)  Answer  the  same  questions  as  in  (a)  and  (b)  for  this  Parliament.  (Of  course,  now 
«  =  KforXe{A,B,C}.) 

RP  42  An  Airport  Game 

Three  airline  companies  share  the  cost  of  a  runway.  To  serve  the  planes  of  company 
i  G  {1,2,3}  the  length  of  the  runway  must  be  q,  which  is  also  the  cost  of  a  runway 
of  that  length.  The  airline  companies  can  form  coalitions,  and  the  cost  of  a  coalition 
is  the  cost  of  the  smallest  runway  long  enough  to  serve  the  planes  of  all  companies 
in  the  coalition.  The  costs  q  are  given  by  q  =  i  for  each  company  i  G  {1,2,3},  and 
we  assume  c\  <  C2  <  C3. 

(a)  Model  this  situation  as  a  three-player  cost  savings  (TU)  game. 

(b)  Compute  the  core  of  this  game. 

(c)  Compute  the  Shapley  value  of  this  game.  Is  it  in  the  core? 

(d)  Suppose  that  at  the  nucleolus  of  this  game  the  excesses  of  the  two-person 
coalitions  are  equal  and  maximal.  What  does  this  imply  for  c  1,  C2,  and  C3? 

RP  43  A  Glove  Game 

There  are  five  players.  Players  1  and  2  each  possess  a  right-hand  glove,  while  players 
3,  4,  and  5  each  possess  a  left-hand  glove.  The  players  can  form  coalitions,  and  the 
worth  of  each  coalition  is  equal  to  the  number  of  pairs  of  gloves  that  the  coalition 
can  make. 

(a)  Compute  the  Shapley  value  of  this  game. 

(b)  Compute  the  core  of  this  game. 

(c)  Compute  the  nucleolus  of  this  game. 

RP  44  A  Four-person  Cooperative  Game 

Consider  the  four-person  game  (N,v)  with  N  —  {1,2, 3, 4},  v({l,2})  = 

u({3,4})  =  2,  n({l,3})  =  3,  v(N)  =  4,  and  v(S )  =  0  for  all  other  coalitions 

S. 

(a)  Compute  the  core  of  this  game.  Plot  the  possible  core  payoffs  of  players  1  and 
3  in  a  two-dimensional  diagram  with  the  payoffs  of  player  1  on  the  horizontal 
axis  and  the  payoffs  of  player  3  on  the  vertical  axis. 

(b)  Compute  the  Shapley  value  of  this  game. 
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RP  45  A  Matching  Problem 

Let  X  —  {x\,X2,X3,X4}  and  Y  =  {^1,3^2, ^3,^4}  be  two  groups  of  people,  and  let 
their  preferences  concerning  possible  partners  be  given  by  the  following  table. 


X\ 

x2 

X3 

X4 

yi 

y2 

ys 

y4 

ys 

ys 

y4 

x2 

X3 

x2 

X\ 

y4 

y2 

y  1 

yi 

X\ 

X4 

X\ 

X4 

y  1 

y4 

y2 

y2 

X3 

x2 

X3 

x2 

y2 

y  1 

y4 

X4 

X\ 

X4 

x3 

(a)  Apply  the  deferred  acceptance  procedure  with  proposals  by  members  of  X. 
Which  matching  do  you  obtain? 

(b)  Apply  the  deferred  acceptance  procedure  with  proposals  by  members  of  Y. 
Which  matching  do  you  obtain? 

(c)  Explain  why  the  matching  (x\  ,y  1) ,  (*2,  3^) ,  (. x 3 , y2) ,  (*4,  y4)  is  not  in  the  core. 

(d)  How  many  matchings  are  in  the  core?  Explain  your  answer. 

RP  46  House  Exchange 

Player  i  owns  house  hi,  i  —  1, 2,  3.  The  preferences  of  the  players  over  the  houses 
are  given  by  the  following  table: 


Player  1 

Player  2 

Player  3 

h2 

hi 

hi 

hi 

h2 

hi 

h3 

hi 

h 

(a)  Compute  all  core  allocations  of  this  game. 

(b)  Which  of  these  allocations  are  in  the  strong  core? 

(c)  Give  a  new  preference  of  player  1  such  that  the  new  game  has  a  unique  core 
allocation. 

RP  47  A  Marriage  Market 

Consider  a  marriage  market  with  four  men  (mi,  m2,  m3,  and  m^)  and  four  women 
(wi,  W2,  W3,  and  W4). 


(a)  Suppose  that  every  woman  has  the  same  preference  m\  >  m2  >  m3  >  m4  over 
the  men.  Argue  that  there  is  a  unique  matching  in  the  core,  and  describe  this 
matching. 
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For  (b)-(d)  assume  that  the  preferences  are  given  by  the  following  table. 


m\ 

m2 

m3 

ni4 

W 1 

w2 

w3 

W4 

W] 

w2 

w3 

W  4 

ni4 

m3 

m2 

m\ 

w2 

W] 

W4 

W3 

m3 

ni4 

m\ 

m2 

w3 

W4 

W  i 

W2 

m2 

m\ 

rri4 

m3 

W4 

W3 

W2 

W 1 

m\ 

m2 

m3 

ni4 

(b)  Compute  the  core  matching  that  is  optimal  from  the  point  of  view  of  the  men. 

(c)  Compute  the  core  matching  that  is  optimal  from  the  point  of  view  of  the  women. 

(d)  Find  a  core  matching  in  which  m\  and  w2  are  coupled. 
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Problems  of  Chapter  1 

1.2  Variant  of  Matching  Pennies 

There  are  saddlepoint(s)  if  and  only  if  v  <  —  1 . 

1.3  Mixed  Strategies 

(b)  (3/4, 1/4). 

(c)  (1/2, 1/2). 

(d)  By  playing  (3/4,  1/4)  player  1  obtains  10/4  =  2.5  for  sure  (independent  of 
what  player  2  does).  Similarly,  by  playing  (1/2, 1/2),  player  2  is  sure  to  pay 
2.5.  So  2.5  is  the  value  of  this  game.  Given  a  rational  opponent,  no  player  can 
hope  to  do  better  by  playing  differently. 

1.4  Sequential  Cournot 

(b)  q\  —  1/3  and  q2  =  1/6. 

1.6  Glove  Game 

(a)  (0,  0, 1)  is  the  unique  vector  in  the  core  of  the  glove  game. 

1.7  Dentist  Appointments 

The  Shapley  value  (9^ ,  6\ ,  8)  is  not  in  the  core  of  this  game.  The  nucleolus  is  in  the 
core  of  the  game. 
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1.8  Nash  Bargaining 

(a)  The  problem  to  solve  is  maxo<a<i  a\/T  —  a.  Obviously,  the  solution  must  be 
interior:  0  <  a  <  1 .  Set  the  first  derivative  equal  to  0,  solve,  and  check  that  the 
second  derivative  is  negative. 

(b)  In  terms  of  utilities,  the  Nash  bargaining  solution  is  (2/3,  (1/3)  y/3). 

1.9  Variant  of  Glove  Game 

The  worth  of  a  coalition  S  depends  on  the  minimum  of  the  numbers  of  right-hand 
and  left-hand  players  in  the  coalition. 


Problems  of  Chapter  2 

2.1  Solving  Matrix  Games 

(a)  The  optimal  strategies  are  (5/1 1 , 6/1 1)  for  player  1  and  (5/1 1 , 6/1 1)  for  player 
2.  The  value  of  the  game  is  30/11.  In  the  original  game  the  optimal  strategies 
are  (5/11, 6/11,0)  for  player  1  and  (5/11, 6/11,0)  for  player  2. 

(b)  The  value  of  the  game  is  0.  The  unique  maximin  strategy  is  (0, 1,0).  The 
minimax  strategies  are  (0,  q,  1  —  q,  0)  for  any  0  <  q  <  1. 

(c)  The  value  of  the  game  is  1,  the  unique  minimax  strategy  is  (1/2,0,  1/2),  and 
the  maximin  strategies  are:  (p,  (1  —  p)/2,  (1  —  p)/2)  for  0  <  p  <  1. 

(d)  The  value  of  the  game  is  9  and  player  l’s  maximin  strategy  is  (l/2,l/2,0,0). 
The  set  of  all  minimax  strategies  is  { (a,  (7  —  14a)/ 10,  (3  +  4a)/ 10)  e  M3  |  0  < 
of  <  1/2}. 

(e)  The  value  is  8/5.  The  unique  maximin  strategy  is  (2/5,  3/5)  and  the  unique 
minimax  strategy  is  (0,  4/5, 1/5,  0). 

(f)  The  value  is  equal  to  1,  player  2  has  a  unique  minimax  strategy  namely  (0, 1,0), 
and  the  set  of  maximin  strategies  is  {(0,p,  1  —  p)  |  0  <  p  <  1}. 

2.2  Saddlepoints 

(b)  There  are  saddlepoints  at  (1, 4)  and  at  (4, 1). 

2.3  Maximin  Rows  and  Minimax  Columns 

A  O 

(c)  The  unique  maximin  strategy  is  0, 0,  0)  and  the  unique  minimax  strategy  is 

(-  0  -) 

V7  ’  7 /• 


2.4  Subgames  of  Matrix  Games 

(c)  The  unique  minimax  strategy  is  (0, 4/5,  1/5, 0)  and  the  unique  maximin 
strategy  is  (2/5,  3/5) . 
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2.5  Rock-Paper-Scissors 
The  associated  matrix  game  is: 


R  P  S 


Problems  of  Chapter  3 

3.1  Some  Applications 

(a)  Let  Smith  be  the  row  player  and  Brown  the  column  player,  then  the  bimatrix 
game  is: 


L  S 

L  (  2,2  -1,-1  \ 

S\- 1,-1  1,1  )  ‘ 

(b)  Let  the  government  be  the  row  player  and  the  pauper  the  column  player.  The 
bimatrix  game  is: 


work  not 

aid  /  3,2  -1,3\ 

not  \-l,  1  0,0  )  ' 


(c)  This  game  has  two  pure  strategy  Nash  equilibria  and  one  other  (mixed 
strategy)  Nash  equilibrium. 

(e,  f)  This  situation  can  be  modelled  as  a  3  x  3  bimatrix  game.  For  (e),  the  expected 
numbers  of  candidates  in  the  committee  can  be  taken  as  payoffs;  for  (f),  the 
payoff  is  the  expected  utility  (using  *Jc)  of  the  lottery  determining  the  number 
of  candidates  in  the  committee. 

3.2  Matrix  Games 

(a)  You  should  find  the  same  solution,  namely  (5/11,6/11)  for  player  1  and 
(5/1 1 , 6/1 1)  for  player  2,  as  the  unique  Nash  equilibrium. 

(b)  If  player  2  plays  a  minimax  strategy  then  2’s  payoff  is  at  least  —v,  where  v  is 
the  value  of  the  game.  Hence,  any  strategy  that  gives  player  1  at  least  v  is  a  best 
reply.  So  a  maximin  strategy  is  a  best  reply.  Similarly,  a  minimax  strategy  is  a 
best  reply  against  a  maximin  strategy,  so  any  pair  consisting  of  a  maximin  and 
a  minimax  strategy  is  a  Nash  equilibrium. 
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Conversely,  in  a  Nash  equilibrium  the  payoffs  must  be  (v,  —  v)  otherwise  one  of 
the  players  could  improve  by  playing  an  optimal  (maximin  or  minimax)  strategy. 
But  then  player  l’s  strategy  must  be  a  maximin  strategy  since  otherwise  player 
2  would  have  a  better  reply,  and  player  2’s  strategy  must  be  a  minimax  strategy 
since  otherwise  player  1  would  have  a  better  reply. 

(c)  The  appropriate  definition  for  player  2  would  be:  a  maximin  strategy  for  player 
2  in  B ,  since  now  B  represents  the  payoffs  to  player  2,  and  not  what  player  2  has 
to  pay  to  player  1 . 

The  Nash  equilibrium  of  Problem  3.1(b),  for  instance,  does  not  consist  of 
maximin  strategies  of  the  players.  The  maximin  strategy  of  player  1  in  A  is 
(1/5,  4/5),  which  is  not  part  of  a  (the)  Nash  equilibrium.  The  maximin  strategy 
of  player  2  (!)  in  B  is  (1,0),  which  is  not  part  of  a  (the)  Nash  equilibrium. 

3.3  Strict  Domination 

(c)  There  are  three  Nash  equilibria:  ((1,0),  (1,0, 0,0)),  ((0, 1) ,  (0, 0,  1,0)),  and 
((3/7,  4/7),  (1/3,0, 2/3,0)). 

3.4  Iterated  Elimination  (1) 

(b)  The  unique  equilibrium  is  (B,  Y). 

3.5  Iterated  Elimination  (2) 

The  Nash  equilibria  are  ((1/3, 2/3, 0),  (2/3, 0, 1/3)),  ((0, 1, 0),  (1, 0,  0)),  and 

((1,0,0),  (0,0,1)). 

3.6  Weakly  Dominated  Strategies 

(b)  Consecutive  deletion  of  Z,  C,  A  results  in  the  Nash  equilibria  (B,  X)  and  ( B,Y ). 
Consecutive  deletion  of  C,  Y ,  B ,  Z  results  in  the  Nash  equilibrium  (A,  X). 

3.7  A  Parameter  Game 

Distinguish  three  cases:  a  >  2,  a  —  2,  and  a  <  2. 

3.8  Equalizing  Property  of  Mixed  Equilibrium  Strategies 

(a)  Check  by  substitution. 

(b)  Suppose  the  expected  payoff  (computed  by  using  q*)  of  row  i  played  with 
positive  probability  (p*)  in  a  Nash  equilibrium  (p*,q*),  hence  the  number 
eAq*,  would  not  be  maximal.  Then  player  1  would  improve  by  adding  the 
probability  p*  to  some  row  j  with  higher  expected  payoff  e'Aq*  >  e'Aq*,  and  in 
this  way  increase  his  payoff,  a  contradiction.  A  similar  argument  can  be  made 
for  player  2  and  the  columns. 
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3.9  Voting 

(a,  b,  c)  Set  the  total  number  of  voters  equal  to  10  (in  order  to  avoid  fractions).  We 
obtain  a  6  x  6  bimatrix  game  in  which  the  sum  of  the  payoffs  per  entry  is 
always  10.  This  game  has  four  Nash  equilibria  in  pure  strategies. 

(d)  Now  we  obtain  a  unique  Nash  equilibrium  in  pure  strategies. 

(e)  In  both  games,  subtract  5  from  all  payoffs. 

3.10  Guessing  Numbers 

(d)  The  unique  Nash  equilibrium  is  the  one  where  each  player  chooses  each  number 
with  equal  probability. 

(e)  The  value  of  this  game  is  ^ . 

3.11  Bimatrix  Games 

(b)  e  <  a,  b  <  d,  c  <  g,  h  <  f.  Then  there  is  a  unique  Nash  equilibrium. 


Problems  of  Chapter  4 

4.1  Counting  Strategies 

White  has  20  possible  opening  moves,  and  therefore  also  20  possible  strategies. 

Black  has  many  more  strategies. 

4.2  Extensive  Versus  Strategic  Form 

For  the  game  with  perfect  information,  start  with  a  decision  node  of  player  1 .  For 

the  game  with  imperfect  information,  start  with  a  decision  node  of  player  2. 

4.4  Choosing  Objects 

(c)  In  any  subgame  perfect  equilibrium  the  game  is  played  as  follows:  player  1  picks 
6>3,  then  player  2  picks  O2  or  0\,  and  finally  player  1  picks  O4.  These  are  the 
(two)  subgame  perfect  equilibrium  outcomes  of  the  game.  Due  to  ties  (of  player 
2)  there  is  more  than  one  subgame  perfect  equilibrium,  namely  eight  in  total. 
All  subgame  perfect  equilibria  result  in  the  same  distribution  of  the  objects. 

(d)  There  is  a  Nash  equilibrium  in  which  player  1  obtains  the  objects  O4  and  O2 . 

4.5  A  Bidding  Game 

(c)  Due  to  ties,  there  are  four  different  subgame  perfect  equilibria.  They  all  result 
in  the  same  outcome. 

4.6  An  Extensive  Form  Game 

There  is  a  unique  pure  strategy  Nash  equilibrium,  which  is  also  subgame  perfect. 

This  equilibrium  is  perfect  Bayesian  for  an  appropriate  choice  of  player  2’s  belief. 
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4.7  Another  Extensive  Form  Game 

There  is  a  unique  Nash  equilibrium  (in  pure  strategies).  This  equilibrium  is  not 

perfect  Bayesian. 

4.8  Still  Another  Extensive  Form  Game 

(b)  There  are  three  Nash  equilibria  in  pure  strategies. 

(c)  There  are  two  subgame  perfect  Nash  equilibria  in  pure  strategies. 

(d)  Both  equilibria  in  (c)  are  perfect  Bayesian. 

4.9  A  Centipede  Game 

(b)  Consider  any  strategy  combination.  The  last  player  who  has  continued  when 
playing  his  strategy  could  have  improved  by  stopping  if  possible.  Hence,  in 
equilibrium  the  play  of  the  game  must  have  stopped  immediately. 

To  exhibit  a  non-subgame  perfect  Nash  equilibrium,  assume  that  player  1  always 
stops,  and  that  player  2  also  always  stops  except  at  his  second  decision  node. 
Check  that  this  is  a  Nash  equilibrium.  [One  can  also  write  down  the  strategic 
form,  which  is  an  8  x  8  bimatrix  game.] 

4.10  Finitely  Repeated  Prisoners’  Dilemma 

(a)  There  are  five  subgames,  including  the  entire  game. 

(b)  There  is  a  unique  subgame  perfect  equilibrium. 

4.11  A  Twice  Repeated  2x2  Bimatrix  Game 

(b)  Player  1 :  play  B  at  the  first  stage;  if  ( B ,  L)  was  played  at  the  first  stage  play  B  at 
the  second  stage,  otherwise  play  T  at  the  second  stage. 

4.12  Twice  Repeated  3x3  Bimatrix  Games 

(a)  There  are  ten  subgames,  including  the  entire  game. 

(b)  Player  1 :  play  T  at  the  first  stage.  Player  2:  play  F  at  the  first  stage.  Second  stage 
play  is  given  by  the  following  diagram: 


L 

M 

R 

T  1 

(  B,R 

C,R 

C,R\ 

c 

B,M 

B,R 

B,R  . 

B  ' 

\B,M 

B,R 

B,R 

For  instance,  if  first  stage  play  results  in  (C,  L),  then  player  1  plays  B  and  player 
2  plays  M  at  stage  2.  Verify  that  this  defines  a  subgame  perfect  equilibrium  in 
which  ( T ,  L)  is  played  at  the  second  stage.  (Other  solutions  are  possible,  as  long 
as  players  1  and  2  are  punished  for  unilateral  deviations  at  stage  1.) 
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Problems  of  Chapter  5 

5.1  Battle -of- the -Sexes 

The  strategic  form  is  a  4  x  4  bimatrix  game.  List  the  strategies  of  the  players  as  in 
the  text.  We  can  then  compute  the  expected  payoffs.  For  example,  if  the  first  row 
corresponds  to  strategy  FF  of  player  1  and  strategies  FF,  FB ,  BF ,  and  BB  of  player 
2,  then  the  payoffs  are,  respectively,  1/6  times  (8,  3),  (6,  9),  (6, 0),  and  (4,6).  The 
(pure  strategy)  Nash  equilibria  are  (FF,  FB)  and  (BF,  BB). 

5.2  A  Static  Game  of  Incomplete  Information 

There  are  three  pure  Nash  equilibria:  (TT,  L),  (TB,  R),  and  (BB,  R).  (The  first  letter 
in  a  strategy  of  player  1  applies  to  Game  1,  the  second  letter  to  Game  2.) 

5.3  Another  Static  Game  of  Incomplete  Information 

(b)  The  unique  pure  strategy  Nash  equilibrium  is:  t\  and  t\  play  B,  ^  and  t'2  play  R. 

5.4  Job-Market  Signaling 

(b)  There  is  a  separating  perfect  Bayesian  equilibrium.  There  is  another  Nash 
equilibrium  in  which  no  worker  takes  education,  but  this  is  not  perfect  Bayesian. 

5.5  A  Joint  Venture 

(c)  There  is  a  unique  Nash  equilibrium  (even  in  mixed  strategies).  This  is  also 
subgame  perfect  and  perfect  Bayesian. 

5.6  Entry  Deterrence 

For  v  <  100  the  strategy  combination  where  the  entrant  always  enters  and  the 
incumbent  colludes  is  a  perfect  Bayesian  equilibrium.  For  v  >  50,  the  combination 
where  the  entrant  always  stays  out  and  the  incumbent  fights  is  a  perfect  Bayesian 
equilibrium  if  the  incumbent  believes  that,  if  the  entrant  enters,  then  fighting  yields 
0  with  probability  at  most  1  —  — .  IC  applies  only  to  the  second  equilibrium,  which 

JC 

survives  it. 

5.7  The  Beer- Quiche  Game 

(b)  There  are  two  perfect  Bayesian  equilibria,  both  of  which  are  pooling.  In  the  first 
one,  player  1  always  eats  quiche.  This  equilibrium  does  not  survive  IC.  In  the 
second  one,  player  1  always  drinks  beer.  This  equilibrium  does  survive  IC. 

5.8  Issuing  Stock 

(b)  There  is  a  pooling  equilibrium  in  which  the  manager  never  proposes  to  issue  new 
stock,  and  such  a  proposal  would  not  be  approved  of  by  the  existing  shareholders 
since  they  believe  that  this  proposal  signals  a  good  state  with  high  enough 
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probability.  [The  background  of  this  is  that  a  new  stock  issue  would  dilute  the 
value  of  the  stock  of  the  existing  shareholders  in  a  good  state  of  the  world, 
see  the  original  article  Myers  and  Majluf  (1984)  for  details.]  This  equilibrium 
survives  the  intuitive  criterion. 

There  is  also  a  separating  equilibrium  in  which  a  stock  issue  is  proposed  in 
the  bad  state  but  not  in  the  good  state.  If  a  stock  issue  is  proposed,  then  it  is 
approved  of. 

Finally,  there  is  a  separating  equilibrium  in  which  a  stock  issue  is  proposed 
in  the  good  state  but  not  in  the  bad  state.  If  a  stock  issue  is  proposed,  then  it  is 
not  approved  of. 

(c)  In  this  case,  a  stock  issue  proposal  would  always  be  approved  of,  so  the 
‘bad  news  effect’  of  a  stock  issue  vanishes.  The  reason  is  that  the  investment 
opportunity  is  now  much  more  attractive. 

5.9  More  Signaling  Games 

(a)  IC  does  not  apply,  since  it  would  rule  out  both  types  of  player  1 . 

(b)  There  is  a  unique,  pooling  perfect  Bayesian  equilibrium.  This  equilibrium  does 
not  survive  the  intuitive  criterion. 

(c)  There  are  two  strategy  combinations  that  are  perfect  Bayesian. 


Problems  of  Chapter  6 

6.1  Cournot  with  Asymmetric  Costs 

The  Nash  equilibrium  is  q\  —  (a  —  2c\  +  c2)/2  and  q2  —  (a  —  2c2  +  c i)/3,  given 
that  these  amounts  are  nonnegative. 

6.2  Cournot  Oligopoly 


(b)  The  reaction  function  of  player  i  is:  /3/(g \ , . . . ,  qi-\ ,  qi+\ , . . . ,  qn)  —  (a  —  c  — 

Qj)/ 2  if  Qj  <  a- c,  and  ,  •  •  • ,  <?,- 1  ,qi+\, . . .  ,qn)  =  0  otherwise. 

(c)  One  should  compute  the  point  of  intersection  of  the  n  reaction  functions.  This 
amounts  to  solving  a  system  of  n  linear  equations  in  n  unknowns  q\, . . . ,  qn. 
Alternatively,  one  may  guess  that  there  is  a  solution  q\  =  q2  =  . . .  =  qn. 
Then  q\  —  (a  —  c  —  (n  —  l)q\)/2,  resulting  in  q\  =  (a  —  c)/(n  +  1).  Hence, 
each  firm  producing  (a  —  c)/(n  +  1)  is  a  Nash  equilibrium.  If  the  number  of 
firms  becomes  large  then  this  amount  converges  to  0,  which  is  no  surprise  since 
demand  is  bounded  by  a. 

(d)  To  show  that  this  equilibrium  is  unique,  it  is  sufficient  to  show  that  the 
determinant  of  the  coefficient  matrix  associated  with  the  system  of  n  linear 
equations  in  n  unknowns  (the  reaction  functions)  is  unequal  to  zero. 
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6.3  Quantity  Competition  with  Heterogenous  Goods 

(a)  Ui(quq2)  =  q&iiquqi)  ~  cqi  for  i  =  1,2. 

(b)  The  equilibrium  is:  q\  —  (21  —  4c)/33,  q2  —  (13  —  3c)/22,/?i  =  (21+7c)/ll, 
P2  —  (39  +  13c)/22.  From  this  the  profits  are  easily  computed. 

(c)  qx  =  (57  -  10c)/ 95,  q2  =  (38  -  10c)/95,pi  =  (228  +  50c)/95,p2  =  (228  + 
45c) /  95.  From  this  the  profits  are  easily  computed. 

(d)  q\  —  max{l  —  \ p\  +  \p2,  0},  q2  —  max{l  —  \ p2  +  \p\ }.  The  profit  functions 
are  now  H\(px,p2)  =  p\q\  -  cq\  and  fl2(/?i,p2)  =  Piqi  cq2 ,  with  qx  and  q2 
as  given. 

(e)  The  equilibrium  is  p\  =  (16  +  8c)/ll,p2  =  (30  +  15c)/22.  Note  that  these 
prices  are  different  from  the  ones  in  (c).  Profits  under  price  competition  will  be 
lower  than  those  under  quantity  competition. 

(f)  These  are  the  same  prices  and  quantities  as  under  (c). 

(g)  See  the  answers  to  (e)  and  (f). 

6.4  A  Numerical  Example  of  Cournot  Competition  with  Incomplete  Information 

q\  —  18/48 ,  qn  =  9/48,  q^  —  15/48.  In  the  complete  information  case  with 
low  cost  we  have  q\  —  q2  —  16/48,  with  high  cost  it  is  q\  —  20/48  and  q2  — 
8/48.  Note  that  the  low  cost  firm  ‘suffers’  from  incomplete  information  since  firm 
1  attaches  some  positive  probability  to  firm  2  having  high  cost  and  therefore  has 
higher  supply.  For  the  high  cost  firm  the  situation  is  reversed:  it  ‘benefits’  from 
incomplete  information. 

6.5  Cournot  Competition  with  Two-Sided  Incomplete  Information 
Similar  to  (6.3)  we  derive: 


qi  —  qi(qH ,  qQ  — 


a -ci-  dqH  -  (1  -  d)qL 


qh  —  qh(qH ,  q£)  — 


a  —  cj2  —  dqH  -  (1  -  d)qL 


qi  =  qdqh ,  qi)  = 


a  —  Cl~  7tqh  —  (1  -  7t)qi 


qH  —  qH^qh ,  qi)  — 


a  —  cH  —  7tqh  -  (1  -  n)qi 


Here,  qi  and  qh  correspond  to  the  low  and  high  cost  types  of  firm  1  and  qL,  and 
qu  correspond  to  the  low  and  high  cost  types  of  firm  2.  The  (Bayesian)  Nash 
equilibrium  follows  by  solving  these  four  equations  in  the  four  unknown  quantities. 
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6.6  Incomplete  Information  About  Demand 

The  equilibrium  is:  q\  —  (dan  +  (1  —  #)<2l  —  c)/3 ,  g#  —  ( an  —  c)/3  +  ((1  — 

d) / 6)  ( aH  —  a£) ,  qL  —  (cll  —  c)/3  —  (d/6)  (clh  —  a£) .  (Assume  that  all  these  quantities 

are  positive.) 

6.7  Variations  on  Two-Person  Bertrand 

(a)  If  ci  <  C2  then  there  is  no  Nash  equilibrium.  (Write  down  the  reaction  functions 
or — easier — consider  different  cases.) 

(b)  In  both  cases  (i)  and  (ii),  there  are  two  equilibria. 

6.8  Bertrand  with  More  Than  Two  Firms 

A  strategy  combination  is  a  Nash  equilibrium  if  and  only  if  at  least  two  firms  charge 

a  price  of  c  and  the  other  firms  charge  prices  higher  than  c. 

6.9  Variations  on  Stackelberg 

(a)  With  firm  1  as  a  leader  we  have  q\  —  ( 1/ 2 ){a  —  2c\  +  c2)  and  q2  =  (l / 4) (a  + 
2ci  —  3c2).  With  firm  2  as  a  leader  we  have  q2  =  (1/2) (a  —  2 c2  +  ci)  and 
q\  =  (1/4 )(a  +  2c2  -  3 c\). 

(b)  The  leader  in  the  Stackelberg  game  can  always  play  the  Cournot  quantity:  since 
the  follower  plays  the  best  reply,  this  results  in  the  Cournot  outcome.  Hence,  the 
Stackelberg  equilibrium — where  the  leader  maximizes — can  only  give  a  higher 
payoff.  (This  argument  holds  for  an  arbitrary  game  where  one  player  moves  first 
and  the  other  player  moves  next,  having  observed  the  move  of  the  first  player.) 

(c)  qt  —  (1/2 l)(a  —  c)  for  i  —  1, 2, . . . ,  n. 


6.10  First-Price  Sealed-Bid  Auction 

(b)  Suppose  that  in  some  Nash  equilibrium  player  i  wins  with  valuation  vt  <  V\. 
Then  the  winning  bid  b[  must  be  at  most  V[  otherwise  player  i  makes  a  negative 
profit  and  therefore  can  improve  by  bidding  (e.g.)  C/.  But  then  player  1  can 
improve  by  bidding  higher  than  bi  (and  win)  but  lower  than  v\  (and  make 
positive  profit).  Other  Nash  equilibria:  (i?i,  v\,  0, 0, . . . ,  0),  (b,  b,b, . . .  ,b)  with 
v\>b>  V2,  etc. 

(d)  If  not,  then  there  would  be  a  Nash  equilibrium  in  which — in  view  of  (c) — all 
players  bid  below  their  valuations.  By  (b)  a  player  with  the  highest  valuation 
wins  the  auction,  so  this  must  be  player  1  if  each  player  bids  below  his  true 
valuation.  But  then  player  1  can  improve  if  b\  >  i>2  and  player  2  can  improve  if 
b\  <  v2. 

6.11  Second-Price  Sealed-Bid  Auction 

(d)  Also  (v\ ,  0  . . . ,  0)  is  a  Nash  equilibrium. 

(e)  The  equilibria  are:  {(/?i,Z?2)  |  b2  >  v\,  0  <  b\  <  v2}  U  {(b\,b2)  \  b\  > 
v2,  b2  <  bi}. 
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6.12  Third-Price  Sealed-Bid  Auction 

(b)  Suppose  V\  >  V2  >  V3  >  . . .  ,  then  bidder  2  could  improve  by  bidding  higher 
than  V\ . 

(c)  Everybody  bidding  the  highest  valuation  Ui  is  a  Nash  equilibrium.  Also 
everybody  bidding  the  second  highest  valuation  V2  is  a  Nash  equilibrium.  (There 
are  many  more!) 

6.13  n-Player  First-Price  Sealed-Bid  Auction  with  Incomplete  Information 
Suppose  every  player  j  ^  i  plays  s* .  If  player  i  s  type  is  Vi  and  he  bids  bi  (which  can 
be  assumed  to  be  at  most  1  —  1  /n  since  no  other  bidder  bids  higher  than  this)  then  the 
probability  of  winning  the  auction  is  equal  to  the  probability  that  very  bid  bjj  i,  is 
at  most  bi  (including  equality  since  this  happens  with  zero  probability).  In  turn,  this 
is  equal  to  the  probability  that  vi  <  n / (n  —  1  )bf  for  every  j  ^  i.  Since  the  players’s 
valuations  are  independently  drawn  from  the  uniform  distribution,  the  probability 
that  player  i  wins  the  auction  is  equal  to  (fn/ (n  —  1  ))bt)n~l,  hence  player  i  should 
maximize  the  expression  (vt  —  bj)(fn/ (n  —  1  ))bi)n~l,  resulting  in  bi  —  (1  —  1  /n)vi. 

6.14  Double  Auction 

(b)  The  probability  of  trade  given  that  vs  <  vb  is  equal  to  2x(l  —  v).  Note  that  this 
is  maximal  for  v  =  1/2,  and  then  it  is  equal  to  1/2. 

(c)  pb(vb)  =  (2/3)vb  +  1/12  and ps(vs)  =  (2/3)vs  +  1/4. 

(d)  Observe  that  no  trade  occurs  if  vs  >  vb.  Suppose  vs  <  vb.  Then  the  (conditional) 
probability  that  trade  occurs  is  9/1 6.  Observe  that  this  is  larger  than  the  maximal 
probability  in  (b). 

6.15  Mixed  Strategies  and  Objective  Uncertainty 

(a)  ((1/2, 1/2),  (2/5,  3/5)). 

6.16  Variations  on  Finite  Horizon  Bargaining 

(a)  Adapt  Table  6.1  for  the  various  cases. 

(b)  The  subgame  perfect  equilibrium  outcome  is:  player  1  proposes  (1  —82  + 
8182,  82  —  8182)  at  t  —  0  and  player  2  accepts. 

(c)  The  subgame  perfect  equilibrium  outcome  in  shares  of  the  good  is:  player  1 
proposes  (1  —  8\  +  8\8\,  8\  —  8\8f)  at  t  —  0  and  player  2  accepts. 

(d)  The  subgame  perfect  equilibrium  outcome  is:  player  1  proposes  (1  —  8  +  82  — 
. . .  +  8T~l  —  8Ts\  ,8  —  82  +  ...—  8T~l  +  £rsi)  at  t  —  0  and  player  2  accepts. 

(e)  The  limits  are  (1/(1  +  5),  5/(  1  +  5)),  independent  of  s. 
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6.17  Variations  on  Infinite  Horizon  Bargaining 

(a)  Conditions  (6.10)  are  replaced  by  x%  —  82 y *  and  y*  =  8\x*.  This  implies 
v*  =  (1  —  82) /(l  —  8182)  andy*  =  (S 1  —  S 182)/ (1  —  S 182).  In  the  strategies  (erf) 
and  (erf),  replace  S  by  <$1  and  82,  respectively.  The  equilibrium  outcome  is  that 
player  l’s  proposal  v*  at  t  —  0  is  accepted. 

(b)  Nothing  essential  changes.  Player  2’s  proposal  y*  is  accepted  at  t  —  0. 

(c)  Nothing  changes  compared  to  the  situation  in  the  text,  since  s  is  only  obtained 
at  t  —  00. 

(e)  Let  p  denote  the  probability  that  the  game  ends.  Then  p  is  also  the  probability 
that  the  game  ends  given  that  it  does  not  end  at  t  —  0.  Hence,  p  =  (l  —  8)  8  p, 

so  that  p  —  1 . 

6.18  A  Principal- Agent  Game 

(a)  This  is  a  game  of  complete  information. 

(b)  The  subgame  perfect  equilibrium  can  be  found  by  backward  induction.  Dis¬ 
tinguish  two  cases:  (i)  strategy  h  is  optimal  for  the  worker  and  (ii)  strategy  / 
is  optimal  for  the  worker.  Show  that  it  is  optimal  for  the  employer  to  induce 
high  effort  by  a  wage  combination  ( wH ,  wl)  with  8 w#  +  2 wl  —  50  and 
Wh  —  wl  >  5. 

6.19  The  Market  for  Lemons 

(b)  There  are  many  subgame  perfect  equilibria:  the  buyer  offers  p  <  5,000  and  the 
seller  accepts  any  price  of  at  least  5,000  if  the  car  is  bad  and  of  at  least  15,000 
if  the  car  is  good.  All  these  equilibria  result  in  expected  payoff  of  zero  for  both. 
There  are  no  other  subgame  perfect  equilibria. 

6.20  Corporate  Investment  and  Capital  Structure 

(b)  Suppose  the  investor’s  belief  that  tx  —  L  after  observing  s  is  equal  to  q .  Then 
the  investor  accepts  s  if  and  only  if 

s[qL  +  (1  -  q)H  +  R]  >  /(I  +  r)  .  (*) 

The  entrepreneur  prefers  to  receive  the  investment  if  and  only  if 

s<R/(tt+R ),  (**) 


for  7T  G  {L,H}. 

In  a  pooling  equilibrium,  q  —  p.  Note  that  (**)  is  more  difficult  to  satisfy  for 
Tt  —  H  than  for  n  —  L.  Thus,  (*)  and  (**)  imply  that  a  pooling  equilibrium 
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exists  only  if 


n  1  +  r)  <  R 

pL  +  (1  -p)H  +  R  ~  H  +  R 

A  separating  equilibrium  always  exists.  The  low-profit  type  offers  s  —  /(I  + 
r)/(L  +  R),  which  the  investor  accepts,  and  the  high-profit  type  offers  ,v  <  /(1  + 
r)/(H  +  R),  which  the  investor  rejects. 

6.21  A  Poker  Game 

(a)  The  strategic  form  of  this  game  is  as  follows: 

aa  aq  ka  kq 

believe/  -1,1  -1/3, 1/3  -2/3, 2/3  0,0  \ 

show  y  2/3,  —2/3  1/3, -1/3  0,0  -1/3, 1/3 )  ' 

Here,  ‘believe’  and  ‘show’  are  the  strategies  of  player  I.  The  first  letter  in  any 
strategy  of  player  II  is  what  player  II  says  if  the  dealt  card  is  a  King,  the  second 
letter  is  what  II  says  if  the  dealt  card  is  a  Queen — if  the  dealt  card  is  an  Ace 
player  II  has  no  choice. 

(b)  Player  I  has  a  unique  optimal  (maximin)  strategy  and  player  2  has  a  unique 
optimal  (minimax)  strategy.  The  value  of  the  game  is  —2/9. 

6.22  A  Hotelling  Location  Problem 

(a)  x\  =  v2  = 

(c)  X\  —  X2  — 

6.23  Median  Voting 

(a)  The  strategy  set  of  each  player  is  the  interval  [0,  30].  If  each  player  i  plays  Xj, 
then  the  payoff  to  each  player  i  is  —  |  ((vi  + . . .  +  xn)/ri)  —  U\ .  A  Nash  equilibrium 
always  exists. 

(b)  The  payoff  to  player  i  is  now  —  |  med/ti , . . . ,  xn)  —  ti\ ,  where  med(-)  denotes  the 
median.  For  each  player,  proposing  a  temperature  different  from  his  true  ideal 
temperature  either  leaves  the  median  unchanged  or  moves  the  median  farther 
away  from  the  ideal  temperature,  whatever  the  proposals  of  the  other  players. 
Hence,  proposing  one’s  ideal  temperature  is  a  weakly  dominant  strategy. 

6.24  The  Uniform  Rule 

(b)  M  =  4  :  (1,3/2,  3/2),  M  =  5  :  (1,2,2),  M  =  5.5  :  (1,2,  5/2),  M  —  6  : 
(1, 2,  3),  M  =  7  :  (2,2,3),Af  =  8  :  (5/2,  5/2,  3),  M  =  9  :  (3,3,3). 
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(c)  If  player  i  reports  q  and  receives  Si  >  t  i  then,  apparently  the  total  reported 
quantity  is  above  M  and  thus,  player  i  can  only  further  increase  (hence,  worsen) 
his  share  by  reporting  a  different  quantity.  If  player  i  reports  tt  and  receives 
Si  <  ti  then,  apparently  the  total  reported  quantity  is  below  M  and  thus,  player 
i  can  only  further  decrease  (hence,  worsen)  his  share  by  reporting  a  different 
quantity. 

There  exist  other  Nash  equilibria,  but  they  do  not  give  different  outcomes 
(shares).  For  example,  if  M  >  Ylj=  1  h,  then  player  1  could  just  as  well  report  0 
instead  of  t\ . 

6.25  Reporting  a  Crime 

(b)  p  =  1  — 

(c)  The  probability  of  the  crime  being  reported  in  this  equilibrium  is  1  —  (1  —p)n  — 
1  —  (c/ v)n^n~l\  This  converges  to  1  —  (c/ v)  for  n  going  to  infinity.  Observe  that 
both  p  and  the  the  probability  of  the  crime  being  reported  decrease  if  n  becomes 
larger. 

6.26  Firm  Concentration 

Let,  in  equilibrium,  n  firms  locate  downtown  and  m  firms  in  the  suburbs,  with  n  —  6 

and  m  —  4. 

6.27  Tragedy  of  the  Commons 

(d)  Suppose,  to  the  contrary,  G*  <  G**.  Then  v(G*)  >  u(G**)  since  v/  <  0,  and 
0  >  t/(G*)  >  v'(G**)  since  v"  <  0.  Also,  G*/n  <  G**.  Hence 

n(G*)  +  (1  /w)GV(G*)  -c>  u(G**)  +  G**ur(G**)  -  c  , 
a  contradiction  since  both  sides  should  be  zero. 


Problems  of  Chapter  7 

7.1  Nash  and  Sub  game  Perfect  Equilibrium  in  a  Repeated  Game  (1) 

(a)  u(A)  =  1  and  the  minimax  strategy  in  A  is  R;  v(—B)  =  —  1  and  the  maximin 
strategy  in  —B  is  D. 

(d)  Player  1  plays  always  U  but  after  a  deviation  switches  to  D  forever.  Player  2 
always  plays  L  but  after  a  deviation  switches  to  R  forever.  We  need  8  >  1  /2  to 
make  this  a  Nash  equilibrium  of  G°°(S). 
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7.2  Nash  and  Subgame  Perfect  Equilibrium  in  a  Repeated  Game  (2) 

(a)  The  limiting  average  payoffs  (2,1),  (1,2),  and  (2/3, 2/3),  resulting  from 
playing,  respectively,  the  Nash  equilibria  ( U,L ),  ( D,R ),  and  ((2/3, 1/3), 
(1/3, 2/3))  at  every  stage;  and  all  payoffs  (. x\ ,  xf)  with  x\ ,  X2  >  2/3. 

(b)  v(A)  —  2/3  and—  v(— B)  —  2/3.  Hence,  all  payoffs  ( x\,x2 )  with  x\,x2  >2/3. 

(c)  The  players  play  ( U ,  L)  at  even  times  and  ( D ,  R)  at  odd  times.  Since  at  each  time 
they  play  a  Nash  equilibrium  of  the  stage  game,  no  trigger  strategies  (describing 
punishment  after  a  deviation)  are  needed. 

(d)  In  this  case  a  trigger  strategy  is  needed.  The  players  alternate  between  ( U,L ), 
(D,  L),  and  ( D,R ). 

7.3  Nash  and  Subgame  Perfect  Equilibrium  in  a  Repeated  Game  (3) 

(a)  The  stage  game  has  a  unique  Nash  equilibrium. 

(b)  v(A)  —  4  since  (D,  L )  is  a  saddlepoint  in  A.  The  minimax  strategy  of  player  2  is 
L.  The  value  of  —  B  is  —1  and  the  maximin  strategy  of  player  1  is  (1/2, 1/2). 

(c)  These  limit  average  payoffs  are  obtained,  for  instance,  by  letting  the  players  play 
( U ,  L)  at  even  times  and  ( D ,  R)  at  odd  times.  After  any  deviation  the  players 
switch  to  playing  ( D ,  L)  (or  ((1/2,  1/ 2),  L))  forever. 

7.4  Subgame  Perfect  Equilibrium  in  a  Repeated  Game 

(c)  Alternate  between  (T,  L)  and  (M,  C). 

7.5  The  Strategies  Tr*  7r* 

An  optimal  moment  for  player  1  to  deviate  would  be  t  —  1 .  For  player  2  it  would  be 

t  =  3. 

7.6  Repeated  Cournot  and  Bertrand 

(a)  Each  player  offers  half  of  the  monopoly  quantity  (half  of  ( a  —  c )/  2)  at  each  time, 
but  if  a  deviation  from  this  occurs,  then  each  player  offers  the  Cournot  quantity 
(a  —  c)/ 3  forever.  This  is  a  subgame  perfect  equilibrium  for  8  >  9/17. 

(b)  In  this  case,  each  player  asks  the  monopoly  price  (a  +  c)/2  at  each  time;  if 
a  deviation  from  this  occurs,  each  player  switches  to  the  Bertrand  equilibrium 
price  p  —  c  forever.  This  is  a  subgame  perfect  equilibrium  for  8  >  1/2. 

7.7  Repeated  Duopoly 

(b)  The  Nash  equilibrium  prices  are  p\  —  p2  =  6. 

(c)  Joint  profit  is  maximized  at  p\  —  p2  —  5. 

(d)  Ask  prices  px  —  p2  —  5,  but  after  a  deviation  switch  to  the  equilibrium  prices 
Pi  —  Pi  —  6.  This  is  a  subgame  perfect  equilibrium  for  8  >  25/49. 
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7.8  On  Discounting 

See  the  solution  to  Problem  6.17(e). 

7.9  On  Limit  Average 

A  sequence  like  1, 3,  5,  7, . . .  has  a  limit  average  of  infinity.  More  interestingly,  one 
may  construct  a  sequence  containing  only  the  numbers  +1  and  —1  of  which  the 
finite  averages  ‘oscillate’,  e.g,  below  —1/2  and  above  +1/2,  so  that  the  limit  does 
not  exist. 


Problems  of  Chapter  8 

8.1  Symmetric  Games 

(a)  (0,  1)  is  the  only  ESS. 

(b)  Both  (1 , 0)  and  (0, 1)  are  ESS.  The  (Nash  equilibrium)  strategy  (1/3,  2/3)  is  not 
an  ESS. 

8.2  More  Symmetric  Games 

(a)  The  replicator  dynamics  is  p  —  p(p—l)(p—l/2),  with  rest  points  p  —  0,  1 , 1/2, 
of  which  only  p  —  1  /  2  is  stable.  The  game  ( A,Ar )  has  a  unique  symmetric  Nash 
equilibrium,  namely  ((1/2, 1/2),  (1/2, 1/2)).  The  unique  ESS  is  (1/2, 1/2). 

8.3  Asymmetric  Games 

(b)  The  replicator  dynamics  is  given  by  the  equations  p  —  pq{\  —  p)  and  q  — 
pq{\  —  q).  There  is  one  stable  rest  point,  namely  p  —  q  —  1,  corresponding 
to  the  unique  strict  Nash  equilibrium  ((1, 0),  (1, 0))  of  the  game.  The  other  rest 
points  are  all  points  in  the  set 

{( P,q )  I  p  —  OandO  <  q  <  1  or  q  —  0  and  0  <  p  <  1 }  . 

8.4  More  Asymmetric  Games 

(a)  The  replicator  dynamics  are  dx/dt  —  x(l  —  x) (2  —  3 y)  and  dy/dt  —  2y(l  — 
2x)(y  —  1).  There  are  no  stable  rest  points. 

(b)  The  replicator  dynamics  are  dx/dt  =  x(x—  1 )  (2y  —  1)  and  dy/dt  —  y(y  —  1 )  (2x — 
1). 

8.5  Frogs  Call  for  Mates 

Note  that  for  (a)  and  (b)  Proposition  8.5  can  be  used.  Similarly,  for  (c)  we  can  use 

Proposition  8.8,  by  stating  the  conditions  under  which  each  of  the  four  pure  strategy 

combinations  is  a  strict  Nash  equilibrium:  ifzi  <  P  m  —  1  and  zi  <  P  +  m  —  1 

then  (Call,  Call)  is  a  strict  Nash  equilibrium,  etc. 
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8.6  Video  Market  Game 

There  are  four  rest  points,  of  which  only  one  is  stable. 


Problems  of  Chapter  9 

9.2  Computing  the  Core 

(a)  {(0,0,1)};  (b)  polygon  with  vertices  (15,5,4),  (9,5,10),  (14,6,4),  and 

(8,6,10). 

9.4  The  Core  of  the  General  Glove  Game 
The  Shapley  value  is  in  the  core. 

9.6  Non-monotonicity  of  the  Core 

(b)  The  core  of  (N,  v')  is  the  set  {(0, 0, 1, 1)}  [use  the  fact  that  C(N,  v')  c  C(N,  u)]. 
Hence,  player  1  can  only  obtain  less  in  the  core  although  the  worth  of  coalition 
{1, 3, 4}  has  increased. 

9.7  Efficiency  of  the  Shapley  Value 

Consider  an  order  i\,  z'2, . . . ,  in  of  the  players.  The  sum  of  the  coordinates  of  the 
associated  marginal  vector  is 

[v{{h})-vm 

+  [v({iuh})  -  v({ii})] 

+  [v({ii,h,  h})  ~  u(0’i>*2})] 

+  .  .  . 

+  [v(N)-v(N\{in})] 

=  v(N)  -  v(0)  =  v(N)  . 

Hence,  every  marginal  vector  is  efficient,  so  the  Shapley  value  is  efficient  since  it  is 
the  average  of  the  marginal  vectors. 

9.8  Computing  the  Shapley  Value 

(a)  d>(N,  v)  —  (1/6, 1/6,  2/3)  ^  C(N,  v );  (b)  (9^,  6\,  8),  not  in  the  core. 

9.9  The  Shapley  Value  and  the  Core 

(a)  a  —  3  (use  Problem  9.5). 

(b)  (2.5,2, 1.5). 

(c)  The  Shapley  value  is  (a/ 3  +  1/2,  a/3,  a/3  —  1/2).  The  minimal  value  of  a  for 
which  this  is  in  the  core  is  15/4. 
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9.10  Shapley  Value  in  a  Two-Player  Game 

®(N,  v)  =  (u({l})  +  (u({l,2})  — u({l})  — 1>({2}))/2,  u({2})  +  (u({l,2})  — u({l})  — 
«({2}))/2). 

9.11  Computing  the  Nucleolus 

(a)  (0,0,1). 

(b)  (11.5,5.5,7). 

(c)  (1  / 5, 1  / 5, 1  / 5, 1  / 5, 1/5, 0, . . . ,  0)  €  R15. 

(d)  In  (N,  v):  (1/2, 1/2,  1/2,  1/2);  in  ( N ,  u'):  (0,  0, 1, 1). 

9.12  Nucleolus  of  Two-Player  Games 

The  nucleolus  is  (v({l})  +  (u({  1 , 2})  -  u({l})  -  u({2}))/2,  u({2})  +  (u({l,  2})  - 
«({l})-«({2}))/2). 

9.13  Computing  the  Core,  the  Shapley  Value,  and  the  Nucleolus 

(a)  The  nucleolus  and  Shapley  value  coincide  and  are  equal  to  (1.5, 2, 2.5). 

(c)  The  maximal  value  of  u  ({ 1 })  is  2.  For  that  value  the  core  is  the  line  segment 
with  endpoints  (2, 1, 3)  and  (2,3,  1). 

9.14  Voting  (1) 

(a)  The  Shapley  value  is  <F(Af,  v)  =  (7/12,3/12,1/12,1/12). 

(b)  The  nucleolus  is  (1 , 0, 0,  0). 

9.15  Voting  (2) 

(c)  ®(N,  v)  =  (1  /60)(9,  9, 14, 14, 14). 

(d)  The  nucleolus  is  (0,  0, 1/3, 1/3, 1/3). 

(e)  The  nucleolus  is  not  in  the  core  (e.g.,  u({l,  3, 4}  =  1  >  2/3),  so  the  core  must 
be  empty.  This  can  also  be  seen  directly. 

9.16  Two  Buyers  and  a  Seller 

(c)  d>(N,v)  =  (1/6 ,4/6,  7/6). 

(d)  The  nucleolus  is  (0, 1  / 2,  3/2). 

9.17  Properties  of  the  Shapley  Value 

(a)  In  &i(N,  v )  the  term  v(S  U  {/})  —  v(S)  occurs  the  same  number  of  times  as  the 
term  v(S  U  {/})  —  v(S)  in  v),  for  every  coalition  S  c  Af  \  {/,/}.  Let  S  be  a 
coalition  with  i  e  S  and  j  S.  Then  v(S\  {/}  U  {/})  =  v(S\  {/}  U  {/}),  so  that 


v(S  U  {/})  -  v(S)  =  v((S  \  {/}  U  {/})  U  {f})  -  v((S  \  {/})  U  {/}) 

-  v((S  \  {/}  U  {/})  U  {f})  -  v((S  \  {/})  U  {/})  , 
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and  also  these  expressions  occur  the  same  umber  of  times.  Similarly  for 
coalitions  S  that  contain  j  but  not  i. 

(b)  This  is  obvious  from  Definition  9.4. 

(c)  Observe  that  it  is  sufficient  to  show  =  1  •  To  show  this,  note  that 

S:i£S 

|S|!(n— |S|— 1)!  _  1  (  n~  1 

|S| 


n ! 


n 


,  so  that 


E 

S:i$S 


|s|!(/i-|s|-i): 


n\ 


=  -  E 

n  ^ 

s=0,l,...,n—  1 

1 

=  -  •  n  =  1  . 
n 


r;1 


-i 


Problems  of  Chapter  1 0 

10.1  A  Division  Problem  (1) 

(b)  In  terms  of  utilities:  (|  \/3,  |),  in  terms  of  distribution:  (|\/3, 1  —  |\/3). 

(c)  The  Rubinstein  outcome  is  v*  where  x*  —  yj  1+g+g2  and  v*  =  1  —  1+^2 . 

(d)  lim^i  v*  =  consistent  with  what  was  found  under  (a). 

10.2  A  Division  Problem  (2) 

Use  symmetry,  Pareto  optimality  and  covariance  of  the  Nash  bargaining  solution. 

10.3  A  Division  Problem  (3) 

(a)  The  distribution  of  the  good  is  2  —  In  utility  terms  this  is 

f  1—  S3  o  1—  s3  1 

ll-54’VZ  1 S4  I' 

(b)  By  taking  the  limit  for  S  ->  1  in  (b),  we  obtain  (1.5,  0.5)  as  the  distribution 
assigned  by  the  Nash  bargaining  solution.  In  utilities:  (0.75,  4//0^5). 

10.4  An  Exchange  Economy 

(a)  x^(p\,p2)  =  (3/?2  +  2pi)/2pu  x%  =  (4Pi  ~P2)/2p2,  =  (pi  +  6p2)/2pu 

x\  —  p\/2p2. 

(b)  (p\,p2)  =  (9,  5)  (or  any  positive  multiple  thereof);  the  equilibrium  allocation  is 
((33/18,  31/10),  (39/18,  9/10)). 

(c)  The  (non-boundary  part  of  the)  contract  curve  is  given  by  the  equation  x^  = 
(17 xf  +  5)/( 2xf  +  8).  The  core  is  the  part  of  this  contract  curve  such  that 
ln(v^  +  1)  +  ln(v^  +  2)  >  In  4  +  In  3  =  In  12  (individual  rationality  constraint 
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for  A)  and  3  ln(5  —  xf)  +  ln(5  —  x%)  >  3  In  2  +  In  4  =  In  12  (individual  rationality 
constraint  for  B ). 

(d)  The  point  xA  =  (33/18,  31/10)  satisfies  the  equation^  =  (17vf +5)/(2xf +  8). 

(e)  For  the  disagreement  point  d  one  can  take  the  point  (In  12,  In  12).  The  set  S 
contains  all  points  uel2  that  can  be  obtained  as  utilities  from  any  distribution 
of  the  goods  that  does  not  exceed  total  endowments  e  =  (4,4).  Unlike  the 
Walrasian  equilibrium  allocation,  the  allocation  obtained  by  applying  the  Nash 
bargaining  solution  is  not  independent  of  arbitrary  monotonic  transformations 
of  the  utility  functions.  It  is  a  ‘cardinal’  concept,  in  contrast  to  the  Walrasian 
allocation,  which  is  ‘ordinal’. 

10.5  The  Matching  Problem  of  Table  10.1  Continued 

(a)  The  resulting  matching  is  ( w\ ,  mi),  (w2,  m2),  W3  and  m3  remain  single. 

10.6  Another  Matching  Problem 

(a)  With  the  men  proposing:  (mi ,  wi),  (m2,  W2),  (m3,  W3).  With  the  women  propos¬ 
ing:  (mi,  wi),  (m2,w3),  (m3,w2). 

(b)  Since  in  any  stable  matching  we  must  have  (mi ,  wi),  the  matchings  found  in  (a) 
are  the  only  stable  ones. 

(c)  Obvious:  every  man  weakly  or  strongly  prefers  the  men  proposing  matching  in 
(a);  and  vice  versa  for  the  women. 

10.7  Yet  Another  Matching  Problem:  Strategic  Behavior 

(b)  There  are  no  other  stable  matchings. 

(c)  The  resulting  matching  is  (mi,  wi),  (m2,  W3),  (m3,  W2).  This  is  clearly  better  for 

W\  . 

10.8  Core  Property  of  Top  Trading  Cycle  Procedure 

All  players  in  a  top  trading  cycle  get  their  top  houses,  and  thus  none  of  these  players 
can  be  a  member  of  a  blocking  coalition,  say  S.  Omitting  these  players  and  their 
houses  from  the  problem,  by  the  same  argument  none  of  the  players  in  a  top  trading 
cycle  in  the  second  round  can  be  a  member  of  S :  the  only  house  that  such  a  player 
may  prefer  is  no  longer  available  in  S\  etc. 

10.9  House  Exchange  with  Identical  Preferences 

Without  loss  of  generality,  assume  that  each  player  has  the  same  preference 
h\h2  . . .  hn.  Show  that  in  a  core  allocation  each  player  keeps  his  own  house. 

10.10  A  House  Exchange  Problem 

There  are  three  core  allocations  namely:  (i)  1  :  hi,  2  :  h*,  3  :  h\,  4  :  h2\  (ii)  1  : 

2  :  /14,  3  :  /ii,  4  :  /13;  (iii)  1  :  h^,  2  :  h\,  3  :  h\,  4  :  h2.  Allocation  (i)  is  in  the  strong 


core. 
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10.11  Cooperative  Oligopoly 

(a)-(c)  Analogous  to  Problems  6.1,  6.2.  Parts  (d)  and  (f)  follow  directly  from  (c). 
For  parts  (e)  and  (g)  use  the  methods  of  Chap.  9. 


Problems  of  Chapter  1 1 

11.1  Preferences 

(a)  If  a  ^  b  and  aRb  and  bRa  then  neither  aPb  nor  bPa ,  so  P  is  not  necessarily 
complete. 

(b)  I  is  not  complete  unless  aRb  for  all  a,  b  e  A.  I  is  only  antisymmetric  if  R  is  a 
linear  order. 

11.2  Pairwise  Comparison 

(a)  C(r)  is  reflexive  and  complete  but  not  antisymmetric. 

(c)  There  is  no  Condorcet  winner  in  this  example. 

11.3  Independence  of  the  Conditions  in  Theorem  11.1 

The  social  welfare  function  based  on  the  Borda  scores  is  Pareto  efficient  but  does 
not  satisfy  II A  and  is  not  dictatorial  (cf.  Sect.  11.1).  The  social  welfare  function  that 
assigns  to  each  profile  of  preferences  the  reverse  preference  of  agent  1  satisfies  IIA 
and  is  not  dicatorial  but  also  not  Pareto  efficient. 

11.4  Independence  of  the  Conditions  in  Theorem  11.2 

A  constant  social  welfare  function  (i.e.,  always  assigning  the  same  fixed  alternative) 
is  strategy-proof  and  nondictatorial  but  not  surjective.  The  social  welfare  function 
that  always  assigns  the  bottom  element  of  agent  1  is  surjective,  nondictatorial,  and 
not  strategy-proof. 

11.5  Independence  of  the  Conditions  in  Theorem  11.3 

A  constant  social  welfare  function  (i.e.,  always  assigning  the  same  fixed  alternative) 
is  monotonic  and  nondictatorial  but  not  unanimous.  A  social  welfare  function  that 
assigns  the  common  top  alternative  to  any  profile  where  all  agents  have  the  same 
top  alternative,  and  a  fixed  constant  alternative  to  any  other  profile,  is  unanimous 
and  nondictatorial  but  not  monotonic. 

11.6  Copeland  Score  and  Kramer  Score 

(a)  The  Copeland  ranking  is  a  preference.  The  Copeland  ranking  is  not  antisym¬ 
metric.  It  is  easy  to  see  that  the  Copeland  ranking  is  Pareto  efficient.  By  Arrow’s 
Theorem  therefore,  it  does  not  satisfy  IIA. 

(b)  The  Kramer  ranking  is  a  preference.  The  Kramer  ranking  is  not  antisymmetric 
and  not  Pareto  efficient.  It  violates  IIA. 
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11.7  Two  Alternatives 

Consider  the  social  welfare  function  based  on  majority  rule,  i.e.,  it  assigns  aPb  if 
\N(a,  b,r) \  >  \N(b,  a,r) | ;  bPa  if  \N(a,  b,r)\  <  \N(b ,  a,  r) | ;  and  alb  if  \N(a,  b,r)\  = 
\N(b,a,r)\. 


Problems  of  Chapter  1 2 

12.1  Solving  a  Matrix  Game 

(c)  v(A)  —  12/5  and  the  unique  optimal  strategies  of  players  1  and  2  are, 
respectively,  (0,  4/5, 1/5, 0)  and  (0,  2/5, 0,  3/5). 

(d)  The  answer  is  independent  of  y  and  the  same  as  in  (c). 

12.3  2x2  Games 

(a)  To  have  no  saddlepoints  we  need  a\\  >  an  or  an  <  an.  By  assume  the  first, 
the  other  inequalities  follow. 

(b)  For  optimal  strategies  p  =  (p,  1  —p)  and  q  =  (q,  1  —  q)  we  must  have  0  <  p  <  1 
and  0  <  q  <  1.  Then  use  that  p  should  be  such  that  player  2  is  indifferent 
between  the  two  columns  and  q  such  that  player  1  is  indifferent  between  the  two 
rows. 

12.4  Symmetric  Games 

Let  x  be  optimal  for  player  1.  Then  xAy  >  v(A)  for  all  y;  hence  yAx  =  — xAy  < 
—v(A)  for  all  y;  hence  (take  y  =  x)  v  (A)  <  —v(A),  so  v(A)  <  0.  Similarly,  derive 
the  converse  inequality  by  considering  an  optimal  strategy  for  player  2. 

12.5  The  Duality  Theorem  Implies  the  Minimax  Theorem 

Let  A  be  an  m  x  n  matrix  game.  Without  loss  of  generality  assume  that  all  entries  of 
A  are  positive.  Consider  the  associated  LP  as  in  Sect.  12.2. 

Consider  the  vector  x  =  (1/m, . . . ,  1/m,  rj)  G  Mm+1  with  rj  >  0.  Since  all  entries 
of  A  are  positive  it  is  straightforward  to  check  that  x  G  V  if  rj  <  Y17=  l  au  / m  f°r 
all  j  —  1, . . . ,  n.  Since  x  •  c  —  —  q  <  0,  it  follows  that  the  value  of  the  LP  must  be 
negative. 

Let  x  G  Omin  and  y  G  Omax  be  optimal  solutions  of  the  LP.  Then  —  = 

— yn+i  <  0  is  the  value  of  the  LP.  We  have  jq  >  0  for  every  i  =  1, . . .  ,m, 
Y7iL\  xi  —  and  (x\, ,  xm)A&  >  xm+ 1  (>  0)  for  every  j  —  1 , ...  ,n.  Optimality 
in  particular  implies  Y17=\  xi  =  so  that  V\  (A)  >  (x\, . . .  ,xm)Ae/  >  xm+  \  for  all 
/,  hence  v\  (A)  >  xm+\.  Similarly,  it  follows  that  V2 (A)  <  y„+i  =  vm+i,  so  that 
V2 (A)  <  v\ (A).  The  Minimax  Theorem  now  follows. 

12.6  Infinite  Matrix  Games 

(a)  A  is  an  infinite  matrix  game  with  for  all  ij  G  N:  a^  —  1  if  i  >  j,  ay  =  0  if  i  =  j , 
and  aij  =  —  1  if  i  <  j. 
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(b)  Fix  a  mixed  strategy  p  =  (p\,p2,  • . .)  for  player  1  with  pt  >  0  for  all  i  £  N 
and  Y^=\Pi  —  1-  If  player  2  plays  pure  strategy  j,  then  the  expected  payoff 

for  player  1  is  equal  to  - X^=i Pi  +  11%+iPi-  Since  E,=  i Pi  =  1,  this 
expected  payoff  converges  to  —1  as  j  approaches  oo.  Hence,  infqpAq  =  —  1, 
so  supp  infq  pAq  =  —  1 .  Similarly,  one  shows  infq  supp  pAq  =  1 ,  hence  the 
game  has  no  ‘value’. 

12.7  Equalizer  Theorem 

Assume,  without  loss  of  generality,  v  =  0.  It  is  sufficient  to  show  that  there  exists 
q  £  W1  with  q  >  0,  Aq  <  0,  and  qn  —  1 .  The  required  optimal  strategy  is  then 
obtained  by  normalization. 

This  is  equivalent  to  existence  of  a  vector  (q,  w)  £  Mw+m  with  q  >  0,  w  >  0, 
such  that 


where  row  vector  en  £  W1, 1  is  the  m  x  m  identity  matrix,  0  is  an  1  x  m  vector  on 
the  left  hand  side  and  anmxl  vector  on  the  right  hand  side.  Thus,  we  have  to  show 
that  the  vector  x  :=  (0,1)  £  Mm+1  is  in  the  cone  spanned  by  the  columns  of  the 
(m  +  1)  x  (n  +  m)  matrix  on  the  left  hand  side.  Call  this  matrix  B  and  call  this  cone 
Z.  Assume  x  ^  Z  and  derive  a  contradiction  using  Theorem  22.1. 


Problems  of  Chapter  1 3 

13.1  Existence  of  Nash  Equilibrium  Using  Brouwer 

(c)  Let  a*  £  YiieN  ^(^0-  If  is  a  Nash  equilibrium  of  G  then 

*(  )=  gTfe)  +  maxR  Uj(Sj,  Or*.)  -  Uj((T*)} 

Sl  1+  Y,  max{0, 

s'eSi 

for  all  i  £  N  and  Si  £  Si,  so  that  a*  is  a  fixed  point  off.  Conversely,  let  a*  be  a 
fixed  point  off.  Then  (*)  holds  for  all  i  £  N  and  S[  £  Si.  Hence 

cr*0;)  ^2  max{°5  ui(so  ff-i)  -  Ufa*)}  =  max{0,  ufsi,  a*z)  -  ufa*)}  . 

s'NSt 

Multiply  both  sides  of  this  equation  by  ufsi,  off  —  ufo*)  and  next  sum  over 
all  s i  £  Si. 
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13.2  Existence  of  Nash  Equilibrium  Using  Kakutani 

For  upper  semi-continuity  of  /3,  take  a  sequence  ok  converging  to  a,  a  sequence 

rk  G  /3((jk )  converging  to  r,  and  show  r  G  /3(cr). 

13.3  Lemma  13.2 

The  only-if  direction  is  straightforward  from  the  definition  of  best  reply. 

13.4  Lemma  13.3 

Take  i  such  that  elAq  >  ekAq  for  all  k  —  1 , ...  ,m.  Then,  clearly,  elAq  >  p'Aq  for 

all  pr  G  Am,  so  e*  G  /3i  (q).  The  second  part  is  analogous. 

13.5  Dominated  Strategies 

(b)  Denote  by  NE(A,  B )  the  set  of  Nash  equilibria  of  (A,  B).  Then 

(p*,  q*)  G  NE(A,  B)  <£>  (p*,  (q;,  0))  G  NE(A,  B)  where  (q7,  0)  =  q* 

O  Vp  G  Am,  q  G  A”_1[p*A(q/,0)  >  pA(q/,0), 
p*5(q',0)  >  p*5(q,0)] 

Vp  G  Am,q  G  A'7_1[p*A/q/  >  pA'q7, 
p*B'q'  >  p*5  q] 

(p*,q ')eNE(A',B')  . 

Note  that  the  first  equivalence  follows  by  part  (a). 

13.6  A  3  x  3  Bimatrix  Game 


(c)  The  unique  Nash  equilibrium  is  ((0, 0,  1),  (0,  0, 1)). 


13.7  A  3  x  2  Bimatrix  Game 

The  set  of  Nash  equilibria  is  Kp.q)  g  A3  x  A2 

{((1,0,0),  (0,1))}. 


Pi  =  o,  qi  > 


13.8  The  Nash  Equilibria  in  Example  13.18 


(a)  Let  p  =  (p\,p2,P3)  be  the  strategy  of  player  1.  We  distinguish  two  cases:  (i) 
Pi  =  0  (ii)  pi  >  0. 

In  case  (i),  reduce  the  game  to 


Pi 

P3 


<1 1 

<72 

<73 

1,1 

0,0 

2,0 

0,0 

1,1 

1,1 
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where  q  =  (q\,  q2,  qs)  is  player  2’s  strategy.  Solve  this  game  graphically.  As 
long  as  player  1  gets  at  least  1  (the  payoff  from  playing  M)  theobtained  equilibria 
are  also  equilibria  of  the  original  game  G. 

In  case  (ii),  R  gives  a  lower  expected  payoff  to  player  2  than  C,  so  the  game 
can  be  reduced  to 


q\  qi 

Pi  (hi  0,0\ 

P2  1,2  1,2  . 

P3  V0’0  M/ 

Solve  this  game  graphically  and  extend  to  G. 

(b)  Consider  again  the  perturbed  games  G(s )  as  in  Example  13.18.  For  q  —  0 
consider  the  strategy  combination  ((s,  1  —  2s,  s),  (s,  1  —  2s,  s))  in  G(s).  For 
q  —  1  consider,  similarly,  ((s,  1  —  2s,  s),  (1  —  2s,  s,  s))  in  G(s);  for  0  <  q  <  1 
consider  ((£,  1  —  2s,  s),  {q  —  s/2,  1  —  q  —  s/2,  s)). 

13.9  Proof  of  Theorem  13.8 

Tf’:  conditions  (13.1)  are  satisfied  and /  =  0,  which  is  optimal  since/  <  0  always. 

‘Only -if’:  clearly  we  must  have  a  —  pAq  and  h  —  p/?q  (otherwise/  <  0  which 
cannot  be  optimal).  From  the  conditions  (13.1)  we  have  p'Aq  <  a  =  pAq  and 
pBcf  <h  —  p£q  for  all  p'  e  Am  and  q'  e  An,  which  implies  that  (p,  q)  is  a  Nash 
equilibrium. 

13.10  Matrix  Games 

This  is  a  repetition  of  the  proof  of  Theorem  12.5.  Note  that  the  solutions  of  program 
(13.3)  give  exactly  the  value  of  the  game  a  and  the  optimal  (minimax)  strategies  of 
player  2.  The  solutions  of  program  (13.4)  give  exactly  the  value  of  the  game  —b  and 
the  optimal  (maximin)  strategies  of  player  1 . 

13.11  Tic-Tac-Toe 

(a)  Start  by  putting  a  cross  in  the  center  square.  Then  player  2  has  essentially  two 
possibilities  for  the  second  move,  and  it  is  easy  to  see  that  in  each  of  the  two 
cases  player  1  has  a  forcing  third  move.  After  this,  it  is  equally  easy  to  see  that 
player  1  can  always  enforce  a  draw. 

(b)  If  player  1  does  not  start  at  the  center,  then  player  2  can  put  his  first  circle  at 
the  center  and  then  can  place  his  second  circle  in  such  a  way  that  it  becomes 
forcing.  If  player  1  starts  at  the  center  then  either  a  pattern  as  in  (a)  is  followed, 
leading  to  a  draw,  or  player  2’s  second  circle  becomes  forcing,  also  resulting  in 
a  draw. 

13.12  Iterated  Elimination  in  a  Three-Player  Game 
The  resulting  strategy  combination  is  ( D ,  l,  L). 
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13.13  Never  a  Best  Reply  and  Domination 

First  argue  that  strategy  Y  is  not  strictly  dominated.  Next  assume  that  Y  is  a  best 
reply  to  strategies  (p,  1  —  p)  of  player  1  and  (q,  1  —  q)  of  player  2,  and  derive  a 
contradiction. 

13.15  A  3 -Player  Game  with  an  Undominated  But  Not  Perfect  Equilibrium 

(a)  First  observe  that  the  set  of  Nash  equilibria  is  {(( p ,  1  —  p),  l,  L)  |  0  <  p  <  1}, 
where  p  is  the  probability  with  which  player  1  plays  U. 

13.16  Existence  of  Proper  Equilibrium 
Tedious  but  straightforward. 

13.17  Strictly  Dominated  Strategies  and  Proper  Equilibrium 

(a)  The  only  Nash  equilibria  are  (U,  /(,  L ))  and  ( D ,  r(,  L)).  Obviously,  only  the  first 
one  is  perfect  and  proper. 

(b)  (D,  r,  L),  is  a  proper  Nash  equilibrium. 

13.18  Strictly  Perfect  Equilibrium 

(a)  Identical  to  the  proof  of  Lemma  13.16,  see  Problem  13.14:  note  that  any 
sequence  of  perturbed  games  converging  to  the  given  game  must  eventually 
contain  any  given  completely  mixed  Nash  equilibrium  a. 

(c)  The  set  of  Nash  equilibria  is  {(( p ,  1  —  p),L)  |  0  <  p  <  1},  where  p  is  the 
probability  on  U.  Every  Nash  equilibrium  of  the  game  (A,  B)  is  perfect  and 
proper.  No  Nash  equilibrium  is  strictly  perfect. 

13.19  Correlated  Equilibria  in  the  Two-Driver  Example  (1) 

Use  inequalities  (13.5)  and  ( 1 3 . 6)  to  derive  the  conditions  :pn+/?i2+P2i+  P22  =  1 , 
Pij  >  0  for  all  ij  e  {1,2  },pn  <  f  mm{pl2,p2\},  P22  <  \min{pn,P2\}- 

13.20  Nash  Equilibria  are  Correlated 
Check  that  (13.5)  and  (13.6)  are  satisfied  for  P. 

13.21  The  Set  of  Correlated  Equilibria  is  Convex 

Let  P  and  Q  be  correlated  equilibria  and  0  <  t  <  1.  Check  that  (13.5)  and  (13.6)  are 
satisfied  for  tP  +  (1  —  t)Q. 

13.22  Correlated  vs.  Nash  Equilibrium 

(a)  The  Nash  equilibria  are:  ((1, 0),  (0, 1)),  ((0, 1),  (1, 0)),  and  ((2/3, 1/3), 
(2/3, 1/3)). 
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13.23  Correlated  Equilibria  in  the  Two-Driver  Example  (2) 
The  matrix  C  is: 


(1,2) 

(2,1) 

(!',  2') 

(2/  1; 

(MO 

(  -10 

0 

-10 

0 

\ 

(1,20 

6 

0 

0 

10 

(2, 10 

0 

10 

6 

0 

(2,20  ' 

(  o 

-6 

0 

-6 

J 

13.24  Finding  Correlated  Equilibria 
There  is  a  unique  correlated  equilibrium 


13.25  Nash,  Perfect,  Proper,  Strictly  Perfect,  and  Correlated  Equilibria 


(d)  {((0,  1,0,  (q\,qi,qf))  \  q3  <  2q\  <  4 q3}  U  {((pi,P2,P3),  (0,  1,0))  |  0  <  px  < 

2p3  <  4pi}. 

(e)  The  only  perfect  equilibrium  is  ((0, 1,0),  (0, 1,0)). 

(f)  ((0,  1 , 0),  (0, 1 , 0))  is  also  the  only  proper  and  strictly  perfect  equilibrium. 

(h)  For  instance /3  =  y  =  3/8,  a  —  8  —  1/8. 


13.26  Independence  of  the  Axioms  in  Corollary  13.40 

Not  OPR:  take  the  set  of  all  strategy  combinations  in  every  game.  Not  CONS:  in 
games  with  maximal  player  set  take  all  strategy  combinations,  in  other  games  take 
the  set  of  Nash  equilibria.  Not  COCONS:  drop  a  Nash  equilibrium  in  some  game 
with  maximal  player  set,  but  otherwise  always  take  the  set  of  all  Nash  equilibria. 

13.27  Inconsistency  of  Perfect  Equilibria 

First  show  that  the  perfect  equilibria  in  Go  are  all  strategy  combinations  where 
player  2  plays  L,  player  3  plays  D ,  and  player  1  plays  any  mixture  between  T  and  B. 
Next  consider  the  reduced  game  by  fixing  player  3’s  strategy  at  D. 


Problems  of  Chapter  1 4 

14.2  An  Extensive  Form  Structure  without  Perfect  Recall 

(a)  The  paths  {(*o,*i)}  and  {(vo,V2)}  contain  different  player  1  actions. 

14.3  Consistency  Implies  Bayesian  Consistency 

With  notations  as  in  Definition  14.13,  for  h  e  H  with  P b(h)  >  0  and  x  e  h  we 
have:  fh{x)  =  lim,,,-^  $"(*)  =  lim^oo  P^(v)/P^(/z)  =  Fb(x)/Fb(h).  Here, 
the  second  equality  follows  from  Bayesian  consistency  of  the  ( bm ,  /3m ). 
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14.4  (Bayesian)  Consistency  in  Signaling  Games 

The  idea  of  the  proof  is  as  follows.  Let  (b,  ft )  be  a  Bayesian  consistent  assessment. 
This  means  that  /3  is  determined  on  every  information  set  of  player  2  that  is  reached 
with  positive  probability,  given  b\.  Take  me  N.  Assign  the  number  1/m2  to  action 
a  of  a  type  i  of  player  1  if  that  type  does  not  play  a  but  some  other  type  of  player  1 
plays  a  with  positive  probability.  Assign  the  number  1  /m2  also  to  action  a  of  type  i 
if  no  type  of  player  1  plays  a  and  player  2  attaches  zero  belief  probability  to  type  i 
conditional  on  player  1  having  played  a.  To  every  other  action  of  a  of  player  1 ,  assign 
the  number  /3(i ,  a)/m ,  where  /3(i ,  a)  is  the  (positive)  belief  that  player  2  attaches  to 
player  1  being  of  type  i  conditional  on  having  played  a.  Next,  normalize  all  these 
numbers  to  behavioral  strategies  b™  of  player  1 .  For  player  2,  just  take  completely 
mixed  behavioral  strategies  b™  converging  to  Z?2-  Then  ( bm ,  /3m)  ->  ( b ,  /3),  where 
the  /3m  are  determined  by  Bayesian  consistency. 

14.5  Sequential  Equilibria  in  a  Signaling  Game 

There  is  one  pure  and  one  completely  mixed  sequential  equilibrium. 

14.6  Computation  of  Sequential  Equilibrium  (1) 

The  unique  sequential  equilibrium  consists  of  the  behavioral  strategies  where  player 
1  plays  B  with  probability  1  and  C  with  probability  1/2,  and  player  2  plays  L  with 
probability  1/2;  and  player  1  believes  that  V3  and  X4  are  equally  likely. 

14.7  Computation  of  Sequential  Equilibrium  (2) 

(b)  The  Nash  equilibria  are  (L,  /),  and  ( R ,  (o',  1  —  a))  for  all  o'  <  1/2,  where  a  is 
the  probability  with  which  player  2  plays  /. 

(c)  Let  71  be  the  belief  player  2  attaches  to  node  y\ .  Then  the  sequential  equilibria 
are:  (L,  /)  with  belief  7t  =  1;  (R,  r)  with  belief  7t  <  1/2;  and  ( R ,  (a,  1  —  of))  for 
any  a  <  1/2  with  belief  Tt  —  1/2. 

14.8  Computation  of  Sequential  Equilibrium  (3) 

(b)  The  Nash  equilibria  are  ( R ,  ( q ,  1  —  q))  with  1/3  <  q  <  2/3.  (The  conditions  on 
q  keep  player  1  from  deviating  to  L  or  M.) 

14.9  Computation  of  Sequential  Equilibrium  (4) 

The  Nash  equilibria  in  this  game  are:  (R,  (q\,q2,qff)  with  <73  <  1/3  and  q\  < 
1/2  —  (3/4)#3,  where  q\ ,  q2,  ^3  are  the  probabilities  put  on  /,  m,  r,  respectively;  and 
((1/4,  3/4, 0),  (1/4,  0,  3/4))  (probabilities  on  L,M,R  and  /,  m,  r,  respectively). 

Let  Tt  be  the  belief  attached  by  player  2  to  y  1 .  Then  with  7t  =  1/4  the  equilibrium 
((1/4, 3/4,0), (1/4, 0,3/ 4))  becomes  sequential.  The  first  set  of  equilibria  contains 
no  equilibrium  that  can  be  extended  to  a  sequential  equilibrium. 

14.10  Computation  of  Sequential  Equilibrium  (5) 

The  Nash  equilibria  are:  ( DB ,  r);  (( R ,  (s,  1  —  s)),  ( q ,  1  —  q))  with  0  <  s  <  1  and 
q  >  1/3,  where  s  is  the  probability  on  A  and  q  is  the  probability  on  /.  The  subgame 
perfect  equilibria  are:  (DB,  r);  (RA,  /);  ((R,  (3/4, 1/4)),  (3/5,  2/5)).  The  first  one 
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becomes  sequential  with  /3  =  0;  the  second  one  with  P  =  l;  and  the  third  one  with 
P  =  3/5. 


Problems  of  Chapter  1 5 

15.1  Computing  ESS  in  2x2  Games  (1) 

ESS  (A)  can  be  computed  using  Proposition  15.4. 

(a)  ESS(A)  =  {e2}.  (b)  ESS(A )  =  {e‘,e2}.  (c)  £SS(A)  =  {(2/3, 1/3)}. 

15.2  Computing  ESS  in  2x2  Games  (2) 

Case  (1):  ESS(A')  =  {e2};  case  (2):  ESS(Af)  =  {e^e2};  case  (3):  ESS(A')  =  {x}  = 
{(a2/ (a\  +  a2),ai/(ai  +  a2))}. 

15.3  Rock-Paper-Scissors  (1) 

The  unique  Nash  equilibrium  is  ((1/3, 1/3, 1/3),  (1/3, 1/3,  1/3)),  which  is  sym¬ 
metric.  But  (1/3,  1/3,  1/3)  is  not  an  ESS. 

15.4  Uniform  Invasion  Barriers 

Case  (1),  e2:  maximal  uniform  invasion  barrier  is  1. 

Case  (2),  e1 :  maximal  uniform  invasion  barrier  is  a\/(a\  +  a2). 

Case  (2),  e2:  maximal  uniform  invasion  barrier  is  a2/ (a i  +  a2). 

Case  (3),  x:  maximal  uniform  invasion  barrier  is  1. 

15.5  Replicator  Dynamics  in  Normalized  Game  (1) 

Straightforward  computation. 

15.6  Replicator  Dynamics  in  Normalized  Game  (2) 

The  replicator  dynamics  can  be  written  as  x  —  [x(a\  +  a2)  —  a2]x(l  —  x ),  where 
x  =  x\.  So  x  =  0  and  v  =  1  are  always  stationary  points.  In  case  (1)  the  graph  of 
x  on  (0, 1)  is  below  the  horizontal  axis.  In  case  (2)  there  is  another  stationary  point, 
namely  at  v  =  a2/(a\  +  a2 );  on  (f),a2/(a\  +  a2))  the  function  x  is  negative,  on 
(a2/ ( a\  +  a2),  1)  it  is  positive.  In  case  (3)  the  situation  of  case  (2)  is  reversed:  the 
function i  is  positive  on  (0,  a2/(a\  +  <22))  and  negative  on  ifa2/ (a\  +  a2),  1). 

15.7  Weakly  Dominated  Strategies  and  Replicator  Dynamics 

(b)  The  stationary  points  are  e1,  e2,  e3,  and  all  points  with  V3  =  0.  Except  e3, 
all  stationary  points  are  Lyapunov  stable.  None  of  these  points  is  asymptotically 
stable.  Also,  e3  is  strictly  dominated  (by  e1)).  [One  can  also  derive  d(x\ / x2) / dt  — 
x\x^/x2  >  0  at  completely  mixed  strategies,  i.e.,  at  the  interior  of  A3.  Hence,  the 
share  of  subpopulation  1  grows  faster  than  that  of  2  but  this  difference  goes  to  zero 
if  V3  goes  to  zero  (e2  is  weakly  dominated  by  e1).] 

15.8  Stationary  Points  and  Nash  Equilibria  (1) 

(a)  NE(A)  —  {(a,  a,  1  —  2a)  \  0  <  a  <  1/2}. 

(b)  By  Proposition  15.18  and  (a)  it  follows  that  {(of,  a,  1  —  2a)  \  0  <  tx  <  1/2}  U 
{e1 ,  e2,  e3}  c  ST  (A),  and  that  possibly  other  stationary  points  must  be  boundary 


454 


24  Hints,  Answers  and  Solutions 


points  of  A3.  By  considering  the  replicator  dynamics  it  follows  that  there  are  no 
additional  stationary  points.  All  stationary  points  except  e1  and  e2  are  Lyapunov 
stable,  but  no  point  is  asymptotically  stable. 

15.9  Stationary  Points  and  Nash  Equilibria  (2) 

(a)  The  Nash  equilibrium  strategies  are:  (0, 1,0),  (1/2,0, 1/2),  (0,2/3, 1/3),  and 

(0,0,1). 

(b)  Use  Proposition  15.18.  This  implies  that  (1,0,0),  (0, 1,0),  (0,0,  1),  (1/2,0, 
1/2),  and  (0,  2/3,  1/3)  all  are  stationary  states.  Any  other  stationary  state  must 
be  on  the  boundary  of  A3  and  have  exactly  one  zero  coordinate.  Using  this  it 
can  be  shown  that  there  are  no  other  stationary  states. 

(c)  The  state  (0, 0, 1)  is  asymptotically  stable.  All  other  stationary  states  are  not 
Lyapunov  stable. 

15.10  Lyapunov  Stable  States  in  2x2  Games 

Case  (1):  e2;  case  (2):  e1  and  e2;  case  (3):  x.  (Cf.  Problem  15.6.) 

15.11  Nash  Equilibrium  and  Lyapunov  Stability 

NE(A )  =  {e1}.  If  we  start  at  a  completely  mixed  strategy  close  to  e1,  then  first  V3 

increases,  and  we  can  make  the  solution  trajectory  pass  e3  as  closely  as  desired.  This 

shows  that  e1  is  not  Lyapunov  stable. 

15.12  Rock-Paper-Scissors  (2) 

(e)  Follows  from  (d).  If  a  >  0  then  any  trajectory  converges  to  the  maximum  point 
of  V1V2V3,  i.e.  to  (1/3,  1/3,  1/3).  If  a  —  0  then  the  trajectories  are  orbits  (V1V2V3 
constant)  around  (1/3,  1/3,  1/3).  If  a  <  0  then  the  trajectories  move  outward, 
away  from  (1/3, 1/3, 1/3). 


Problems  of  Chapter  1 6 

16.1  Imputation  Set  of  an  Essential  Game 

Note  that  I(v)  is  a  convex  set  and  f  £  I(v)  for  every  i  —  1  ,...,n.  Thus, 
I(v)  contains  the  convex  hull  of  {f  |  i  £  N}.  Now  let  x  £  /(u),  and  write 
X  =  ( v(l ), . . . ,  v(n))  +  («!,.. .  ,a„),  where  £ ZieNai  =  V(N)  ~  E;g/v  vO  =:  “• 

16.2  Convexity  of  the  Domination  Core 

First  prove  the  following  claim:  For  each  x  £  I(v)  and  0/^ciVwe  have 
3z  £  I(v)  :  zdom sx  <£>  v(S)  <  v(S)  andv(S)  <  U(A0-X>(0. 

i£S 

Use  this  claim  to  show  that  I(v)  \  D(S )  is  a  convex  set.  Finally,  conclude  that  DC(v ) 
must  be  convex. 
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16.3  Dominated  Sets  of  Imputations 

(b)  In  both  games,  D(ij)  =  {x  e  I(v)  \  xt  +  xj  <  v(ij )},  ij  e  {1,2,3},/  ^  j. 

16.7  A  Glove  Game 

(b)  The  core  and  the  domination  core  are  both  equal  to  {(0,1,0)},  cf.  Theo¬ 
rem  16.12. 

16.11  Core  and  D-Core 

Condition  (16.1)  is  not  a  necessary  condition  for  equality  of  the  core  and  the  D- 
core.  To  find  a  counterexample,  first  note  that  if  C(v)  ^  0  then  (16.1)  must  hold. 
Therefore,  a  counterexample  has  to  be  some  game  with  empty  core  and  D-core. 

16.12  Strategic  Equivalence 
Straightforward  using  the  definitions. 

16.13  Proof  of  Theorem  16.20 
Write  B  =  ^  f\.  Then 

max{b  •  y  |  Ay  =  c,  y  >  0}  =  max{b  •  y  |  By  <  (c,  —  c),  y  >  0} 

=  min{(c,  — c)  •  (x,  z)  |  (x,  z )B  >  b,  (x,  z)  >  0} 

=  min{c  •  (x  —  z)  |  (x  —  z)A  >  b,  (x,  z)  >  0} 

=  min{c  •  x'  \  x'A  >  b}  . 

The  second  equality  follows  from  Theorem  22.6. 

16.14  Infeasible  Programs  in  Theorem  16.20 
Follow  the  hint. 

16.15  Proof  of  Theorem  16.22  Using  Lemma  22.5 
Follow  the  hint  and  investigate  (b)  of  Lemma  22.5. 

16.17  Minimum  of  Balanced  Games 

Follows  by  using  the  definition  of  balancedness  or  by  Theorem  16.22. 

16.18  Balanced  Simple  Games 
Let  (N,  v )  be  a  simple  game. 

Suppose  i  is  a  veto  player.  Let  B  be  a  balanced  collection  with  balanced  map  A. 
Then 


y]A(SMS)  =  mv(S)  <  1  =  v(N)  , 

S^B  S€B:i€S 


since  i  is  a  veto  player.  Hence,  v  is  balanced. 
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For  the  converse,  suppose  v  is  balanced,  and  distinguish  two  cases: 

Case  1:  There  is  an  i  with  f ({/})  =  1.  Show  that  i  is  a  veto  player. 

Case  2:  f ({/})  =  0  for  every  i  e  N.  Show  that  also  in  this  case  v  has  veto  players. 


Problems  of  Chapter  1 7 

17.1  The  Games  1 7 

(c)  For  /  e  T:  <?,( lr)  =  (|rhl)-f~|r|)! . 

17.2  Unanimity  Games 

(a)  Suppose  a tut  =  0,  where  0  means  the  zero-game,  for  some  aj  £  R. 

Show  that  all  aj  are  zero  by  induction,  starting  with  one-person  coalitions. 

(b)  Let  We  2N,  then  show 

yy,cTuT{w)  =  v(w)+  v(s)  Y  (-i)|7-Hs| . 

7^0  S :  S£W  T:S^T^W 

It  is  sufficient  to  show  that  the  second  term  of  the  last  expression  is  equal  to  0, 
hence  that  XlrscrcwC- —  0. 

17.3  If-Part  of  Theorem  17.4 

EFF,  NP  and  ADD  are  straightforward.  SYM  needs  more  attention.  Let  ij  be 
symmetric  in  v.  Note  that  for  S  c  N  with  i  ^  S  and  j  e  S  we  have  v((SUi)\j)  =  v(S) 
by  symmetry  of  i  and  j ,  since  v((S  U  i)  \j)  =  v((S  \j)  U  i)  and  v(S)  =  v((S  \j)  U  j). 
Use  this  to  show  £>j(i>)  =  by  collecting  terms  in  a  clever  way. 

17.4  Dummy  Player  Property  and  Anonymity 

That  DUM  implies  NP  and  the  Shapley  value  satisfies  DUM  is  straightforward.  AN 
implies  SYM:  Let  i  and  j  be  symmetric  players,  and  let  the  value  f  satisfy  AN. 
Then  consider  the  permutation  a  with  a(i)  =  j ,  o(j)  —  i,  and  <j(k)  —  k  otherwise. 

17.5  Shapley  Value,  Core,  and  Imputation  Set 

In  the  case  of  two  players  the  core  and  the  imputation  set  coincide.  If  the  core  is  not 
empty  then  the  Shapley  value  is  in  the  core,  cf.  Example  17.2.  In  general,  consider 
any  game  with  v(l)  =  2,  v(N)  —  3,  and  v(S)  =  0  otherwise.  Then  the  Shapley 
value  is  not  even  in  the  imputation  set  as  soon  as  n  >  3. 

17.6  Shapley  Value  as  a  Projection 

If  a  is  an  additive  game  then  0(a)  =  (a(l),  a(2), . . . ,  a(n)).  For  a  general  game  v 
let  av  be  the  additive  game  generated  by  0(f).  Then  Q(av)  —  (av(  1), . . . ,  av(n))  — 

0(f). 

17.7  Shapley  Value  of  Dual  Game 

Follow  the  hint,  or  give  a  direct  proof  by  using  (17.4). 
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17.8  Multilinear  Extension 

(b)  Let  g  be  another  multilinear  extension  of  v  to  [0,  l]n ,  say  g(x)  = 

{Y\ieTxi)-  Show  bj  —  cj  for  all  T  by  induction,  starting  with  one- 
player  coalitions. 

17.9  The  Beta-Integral  Formula 
Apply  partial  integration. 

17.10  Path  Independence  of  O 
Use  Theorem  17.12(c). 

17.11  An  Alternative  Characterization  of  the  Shapley  Value 

The  Shapley  value  satisfies  all  these  conditions.  Conversely,  (b)-(d)  imply  standard¬ 
ness  for  two-person  games,  so  the  result  follows  from  Theorem  17.18. 


Problems  of  Chapter  1 8 

18.1  Marginal  Vectors  and  Dividends 

(b)  For  each  i  &N,mf  =  Ercp^oui,  ;er  Av(T). 

18.2  Convexity  and  Marginal  Vectors 
Use  Theorems  18.3  and  18.6. 


18.3  Strictly  Convex  Games 

Let  jt  and  a  be  two  different  permutations  and  suppose  that  k  >  1  is  the  minimal 
number  such  that  n(k)  ^  cr(k).  Then  show  that  m^(n)  <  m°^(v).  Hence, 
m71  ^  m° . 

18.4  Sharing  Profits 

(a)  For  the  landlord:  O0(u)  =  ^  [E"=o/<X>]- 

(c)  Extend/  to  a  piecewise  linear  function  on  [0,  n\.  Then  v  is  convex  if  and  only  if 
/  is  convex. 


18.5  Sharing  Costs 


(a)  For  every  nonempty  coalition  S ,  v(S)  —  JfieSCi  —  max{c;  |  i  e  S}.  If  we 
regard  c  —  (c\ , . . . ,  cf)  as  an  additive  game  we  can  write  v  —  c  —  cmax,  where 
Unax (S)  =  max{c/  |  i  e  5}. 


18.6  Independence  of  the  Axioms  in  Theorem  18.8 


(a)  Consider  the  value  which,  for  every  game  v ,  gives  each  dummy  player  his 
individual  worth  and  distributes  the  rest,  v(N)  —  )  ]ieI)  v(i)  where  D  is  the  subset 
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of  dummy  players,  evenly  among  the  players.  This  value  satisfies  all  axioms 
except  LIN. 

(b)  Consider  the  value  which,  for  every  game  v,  distributes  v(N)  evenly  among  all 
players.  This  value  satisfies  all  axioms  except  DUM. 

(c)  The  value  which  gives  each  player  his  individual  worth  satisfies  all  axioms 
except  EFF. 

(d)  Consider  any  set  of  weights 


X!jren(A0  aJT 


—  1 .  The  value 


nEU(N) 


7i  G  IT  (A/)}  with  an  G  R  for  all  n  and 
of t rm71  satisfies  FIN,  DUM  and  EFF,  but 


not  MON  unless  all  weights  are  nonnegative. 


18.7  Null- Player  in  Theorem  18.8 

Check  that  the  dummy  axiom  in  the  proof  of  this  theorem  is  only  used  for  unanimity 
games.  In  those  games,  dummy  players  are  also  null-players,  so  it  is  sufficient  to 
require  NP.  Alternatively,  one  can  show  that  DUM  is  implied  by  ADD  (and,  thus, 
FIN),  EFF  and  NP. 

18.8  Characterization  of  Weighted  Shapley  Values 

Check  that  every  weighted  Shapley  value  satisfies  the  Partnership  axiom.  Con¬ 
versely,  let  f  be  a  value  satisfying  the  Partnership  axiom  and  the  four  other  axioms. 
Fet  S[  :=  {/  g  A  |  >  and  for  every  i  g  Sl  let  coi  :=  ffu^).  Define, 

recursively,  Sk  :  —  {i  e  N  \  (Sl  U  . . .  U  Sk~l )  |  V^(%\(s1u...uvt-1))  >  0}  and  for 
every  i  G  Sk  let  co;  :  =  fi(^N\(slu...usk~1))'  This  results  in  a  partition  (S1, . . . ,  Sm )  of 
N.  Now  define  the  weight  system  w  by  the  partition  (Si, ... ,  Sm)  with  Si  :=  Sm, 
S2  :  =  Sm~l,  . . .,  Sm  :  —  S1,  and  the  weights  00.  Then  it  is  sufficient  to  prove 
that  for  each  coalition  S  and  each  player  i  e  S  we  have  ffus)  —  <&™(us).  Let 
h  :=  ma x{j  \  S  H  Sj  ^  0},  then  with  T  —  N  \  (Sh+i  U  . . .  U  Sm)  we  have  by 
the  Partnership  axiom:  ffus)  —  ^^T)(S)  ^  (ut)  •  If  i  £  S/7  then  ffuj)  —  0,  hence 
fiius)  =  0  =  O^Ms).  If  i  g  Sh  then  fi(us)  =  y  a  Q).  = 

1  Z^jesnSh  mj  L 

18.9  Core  and  Weighted  Shapley  Values  in  Example  18.2 
First  write  the  game  as  a  sum  of  unanimity  games: 

v  —  ^{1,2}  +  u{i,3}  ~  u{2,3}  +  2un  . 

Then  consider  all  possible  ordered  partitions  of  A,  there  are  13  different  ones, 
and  associated  weight  vectors.  This  results  in  a  description  of  all  payoff  vectors 
associated  with  weighted  Shapley  values,  including  those  in  the  core  of  the  game. 


Problems  of  Chapter  1 9 

19.1  Binary  Relations 

Not  (4):  >:  on  R  defined  by  v  >:  y  <£>  x2  >  y1. 

Not  (3):  >  on  M2. 


24  Hints,  Answers  and  Solutions 


459 


Not  (2):  >:  on  R  defined  by:  for  all  x,  y,  x  >  y,  let  x  h  y  if  x  —  y  >  1 ,  and  let  y  >  x 

if  x  —  y  <  1 . 

Not(l):  >  on  R. 

The  ordering  on  R,  defined  by  [x  h  y]  O  [x  =  y  or  0  <x,y  <  1]  is  reflexive  and 
transitive  but  not  complete  and  not  anti- symmetric. 

19.2  Linear  Orders 

If  x  >-  y  then  by  definition  x  >y  and  not  y  ^  x:  hence  x  ^  y  since  otherwise  y  ^  x 
by  reflexivity. 

If  x  >:  y  and  x  ^  y  then  not  y  >:  x  since  otherwise  x  =  y  by  anti- symmetry.  Hence 
x  >-  y. 

19.3  The  Lexicographic  Order  (1) 

Check  (l)-(4)  in  Sect.  19.2  for  ^iex.  Straightforward. 

19.4  The  Lexicographic  Order  (2) 

This  is  the  set  {{x\,x^)  G  M1 2  |  [x\  —  3,  v2  >  1]  or  [x\  >3]}.  This  set  is  not  closed. 

19.5  Representability  of  Lexicographic  Order  (1) 

Consider  Problem  19.4.  Since  (a,  0)  ^iex  (3, 1)  for  all  a  >  3,  we  have  u(a,  0)  > 
u( 3, 1)  for  all  a  >  3  and  hence,  by  continuity  of  u ,  lim^  u(a,  0)  >  u(3,  1). 
Therefore  (3,  0)  ^iex  (3, 1),  a  contradiction. 

19.6  Representability  of  Lexicographic  Order  (2) 

Suppose  that  u  represents  ^iex  on  M2,  that  is,  x  ^iex  y  if  and  only  if  u(x)  >  u( y) 
for  all  x,  y  e  M2.  Then  for  any  t  e  R  let  q(t)  be  a  rational  number  in  the  interval 
[ u(t ,  0),  u(t,  1)].  Since  ( t ,  a)  ^iex  (s,  fi )  and  hence  u(t,  a)  >  u(s,  fi )  for  allt  >  s  and 
all  a,  /3  e  [0, 1],  we  have  [ u(t ,  0),  u(t,  1)]  D  [u(s,  0),  u(s ,  1)]  =  0  for  all  t  s.  Hence, 
q(t )  ^  q(s)  for  all  t  s.  Therefore,  we  have  found  uncountably  many  different 
rational  numbers,  a  contradiction. 

19.7  Single -Valuedness  of  the  Pre-nucleolus 

Consider  the  pre-nucleolus  on  a  suitable  compact  convex  subset  and  apply  Theo¬ 
rem  19.3. 

19.8  (Pre-)Nucleolus  and  Core 

Use  the  fact  that  core  elements  have  all  excesses  non-positive. 

19.9  Kohlberg  Criterion  for  the  Nucleolus 

First  observe  that  the  following  modification  of  Theorem  19.4  holds: 

Theorem  19.4 '  Let  (TV,  v )  be  a  game  and  x  e  I(N,  v).  Then  the  following  two 
statements  are  equivalent. 

(1)  x  =  v(N ,  v). 

(2)  For  every  a  such  that  V(a,  x,v)^0  and  for  every  side-payment  y  with  y(S')  > 
0  for  every  S  e  V(a,  x,  v )  and  with  y/  >  0  for  all  i  e  N  with  Xi  =  v(i)  we  have 
y(S )  =  0  for  every  S  G  V(a,  x,  v). 
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The  proof  of  this  theorem  is  almost  identical  to  the  proof  of  Theorem  19.4.  In  the 
second  sentence  of  the  proof,  note  that  zs  e  I(N,  v )  for  e  small  enough.  In  the 
second  part  of  the  proof,  (2)=MT),  note  that  yt  —  Zi  —  Xi  >  0  whenever  jc/  =  v(i). 

For  the  ‘if’ -part  of  the  statement  in  this  problem,  let  x  G  I(N,  v),  V(a,  x,  v )  ^  0, 
and  £(a,  x,v)  such  that  V(a ,  x,  v)  U  £  (a ,  x,  v)  is  balanced.  Consider  a  side-payment 
y  with  y(S)  >  0  for  every  S  G  V(a,x,v)  and  yt  >  0  for  every  i  with  xi  —  v(i)  [hence 
in  particular  for  every  i  with  {/}  G  £  (a,  x,  v)].  The  argument  in  the  first  part  of  the 
proof  of  Theorem  19.5  now  applies  to  V(a,  x,  v )  U  £(a,  x,  v),  and  Theorem  19.4' 
implies  x  =  v(N,  v). 

For  the  ‘only-if’  part,  consider  the  program  (19.4)  in  the  second  part  of  the  proof 
of  Theorem  19.5  but  add  the  constraints  —  y*  <  0  for  every  i  G  N  with  Xi  —  v (i) . 
Theorem  19.47  implies  that  the  dual  of  this  program  is  feasible,  that  is,  there  are 
A (S)  >  0,  S  g  V(a,  x,  v ),  A({/})  >  0,  i  such  that  Xi  —  v(i ),  and  A (N)  e  R  such  that 

-  ]T  A ({/})e«-  J2  meS  +  U.N)eN=  e"  • 

ieN:  Xi=v(i )  S^V((x,x,v)  S^T>(a,x,v) 

Hence  X(N)eN  =  Essp^.x.^C1  +  A(5))e5  +  £«eiv:  Xi=v(0  A({i})e{i}.  Let  £(a,x,  v) 
consist  of  those  one-person  coalitions  {/}  with  X[  —  v(i)  and  A({/})  >  0,  then 
V(a,  x,v)  U  £  (a,  x,  v )  is  balanced. 

19.10  Proof  of  Theorem  19.5 

To  formulate  the  dual  program,  use  for  instance  the  formulation  in  Theorem  16.20. 
For  instance,  the  primal  (19.4)  can  be  converted  to  the  minimization  problem 
in  Theorem  16.20;  then  the  dual  corresponds  to  the  maximization  problem  in 
Theorem  16.20.  Feasibility  of  the  dual  follows  from  Problem  16.14. 

19.11  Nucleolus  of  a  Three-Person  Game  (1) 

The  nucleolus  is  (5,  4,  3). 

19.12  Nucleolus  of  a  Three-Person  Game  (2) 

The  (pre-)nucleolus  is  (5,  3, 2). 

19.14  Individual  Rationality  Restrictions  for  the  Nucleolus 

The  nucleolus  is  ( 1 , 0 , 0) .  The  pre-nucleolus  is  (5/3,— 1/3,— 1/3). 

19.15  Example  19.7 

The  set  B\  —  {123, 124,  34}  is  balanced  with  weights  all  equal  to  1/2.  The  set 
B\  UB2  —  {123, 124,  34, 134, 234}  is  balanced  with  weights,  respectively,  equal  to 
5/12,  5/12,3/12,2/12,2/12. 

19.16  (Pre-)Nucleolus  of  a  Symmetric  Game 
(a)  v(v)  =  v*(i>)  =  (v(N)/n)eN . 

19.17  COV  and  AN  of  the  Pre-nucleolus 
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Covariance  of  the  pre-nucleolus  follows  since  applying  a  transformation  as  in 
the  definition  of  this  property  changes  all  excesses  (only)  by  the  same  positive 
(multiplicative)  factor. 

Anonymity  of  the  pre-nucleolus  follows  since  a  permutation  of  the  players  does 
not  change  the  ordered  vectors  9(x),  but  only  permutes  the  coalitions  to  which  the 
excesses  correspond. 

19.18  Apex  Game 

The  (pre-)nucleolus  is  (3/7,  1/7,  1/7,  1/7,  1/7).  This  can  easily  be  verified  using 
the  Kohlberg  criterion. 

19.19  Landlord  Game 

(a)  By  anonymity,  each  worker  is  assigned  \\f(n)  —  f(n  —  1)].  By  computing  the 
excesses,  it  follows  that  among  all  coalitions  containing  the  landlord,  with  this 
payoff  vector  the  maximal  excesses  are  reached  by  the  coalitions  containing 
n  —  1  workers,  and  further  also  by  the  coalitions  consisting  of  a  single  worker 
and  not  the  landlord.  By  the  Kohlberg  criterion  this  immediately  implies  that  the 
given  vector  is  the  (pre-)nucleolus.  For  the  Shapley  value,  see  Problem  18.4. 

(b)  Compute  the  excesses  for  the  payoff  vector  anc[  apply  the  Kohlberg 

criterion. 

19.20  Game  in  Sect.  19.1 

The  first  linear  program  is:  minimize  a  subject  to  the  constraints  Xi  +  a  >  4  for 
i  —  1 , 2 ,  3,  T X2  T  a  ^  8,  x\  T x 3  ot  ^  12,  X2  T X3  T  ot  ^  1 6,  Vi  T X2  T X3  —  24. 
The  program  has  optimal  value  a  —  —2,  reached  for  x\  —  6  and  X2,x 3  >  6. 

In  the  second  program  x\  has  been  eliminated.  This  program  reduces  to: 
minimize  a  subject  to  X2  +  ot  >  4,  X2  <  12  +  a,  X2  +  V3  =  18.  This  has  optimal 
value  a  =  —4,  reached  for  V2  =  8,  X3  =  10. 

19.21  The  Prekernel 

For  ij  e  N  denote  by  7/-  those  coalitions  that  contain  player  i  and  not  player  j.  For 
a  payoff  vector  x  denote  by  Sy(x,  v)  the  maximum  of  e(S,  x,  v)  over  all  S  e  7//. 

Let  now  x  be  the  pre-nucleolus  and  suppose,  contrary  to  what  has  to  be  proved, 
that  there  are  two  distinct  players  k,  l  such  that  sm(x,v)  >  sm(x,  v).  Let  S  = 
Ojm(x,  v)  -  Stk(x,  v))/2  and  define  y  by  yk  =  xk  +  8,  yt  =  xt  -  8,  and  yt  =  x, 
for  all  i  ^  k,  i.  Denote  S  —  {S  e  2N  \  %i  \  e(S,  x,  v)  >  sk/(x,  v )}  and  s  =  l^l.  Then 
0H-i(x)  =  su(x,  v ).  For  S  e  2N  \  (Tu  U  Tik),  we  have  e(S,  x,  v)  =  e(S,  y,  v).  For 
S  e  Tu  we  have  e(S,  y,  v)  —  e(S,  x,  v)  —  8.  Finally,  for  S  e  Tek  we  have 

e(S ,  y,  v)  =  e(S,  x,  v)  +  8  <  sa(x,  v)  +  8  =  ski(x,  v)  -  8  . 

Thus,  6,( y)  =  0,(x)  for  all  t  <  s  and  0i+i(y)  <  ski(x,v)  =  9s+l(x).  Hence 
0(x)  >|cx  9(y),  a  contradiction. 
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Problems  of  Chapter  20 

20.2  Example  20.3 

Argue  that  an  =  an  =  3  if  v  were  an  assignment  game.  Use  this  to  derive  a 
contradiction. 

20.3  Subgames  of  Permutation  Games 

That  a  subgame  of  a  permutation  game  is  again  a  permutation  game  follows 
immediately  from  the  definition:  in  (20.3)  the  worth  v(S)  depends  only  on  the 
numbers  ky  for  ij  e  S.  By  a  similar  argument  [consider  (20.1)]  this  also  holds 
for  assignment  games. 

20.4  A  Flow  Game 

(c)  (1,  1,0,0),  corresponding  to  the  minimum  cut  through  e\  and  ep,  {(0,0,  1  + 
o',  1  —  a)  |  0  <  a  <  1},  corresponding  to  the  minimum  cut  through  e 3  and  e\. 

20.5  Every  Nonnegative  Balanced  Game  is  a  Flow  Game 

Let  v  be  a  nonnegative  balanced  game,  and  write  (following  the  hint  to  the  problem) 
v  —  Y^r=  l  arWr,  where  otr  >  0  and  wr  a  balanced  simple  game  for  each  r  = 
1 , . . . ,  k.  Consider  the  controlled  capacitated  network  with  two  vertices,  the  source 
and  the  sink,  and  k  edges  connecting  them,  where  each  edge  er  has  capacity  ar  and 
is  controlled  by  wr.  Then  show  that  the  associated  flow  game  is  v. 

20.6  On  Theorem  20.6  (1) 

(a)  This  follows  straightforwardly  from  the  proof  of  Theorem  20.6. 

(b)  For  example,  each  player  receiving  5  ^  is  a  core  element. 

20.7  On  Theorem  20.6  (2) 

In  any  core  element,  player  should  1  receive  at  least  1  and  player  2  also,  but  v(N)  = 
1 .  Hence  the  game  has  an  empty  core. 

20.8  Totally  Balanced  Flow  Games 

This  follows  immediately  from  Theorem  20.6,  since  every  dictatorial  game  is 
balanced,  i.e.,  has  veto  players. 

20.9  If -Part  of  Theorem  20.9 

We  show  that  the  Banzhaf  value  satisfies  2-EFF  (the  other  properties  are  obvious). 
With  notations  as  in  the  formulation  of  2-EFF,  we  have 

Mvp )  =  E  M5  u  {p })  -  vp(5)] 

SC(N\p)U{p):  p£S 

=  E  jstz  KS  U  {(/})- W(S)] 

SQN\{ij} 
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=  E  [Ms  u  {y»  -  Ms)] . 

SQN\{ij} 

The  term  in  brackets  can  be  written  as 

[v(S  U  {ij})  -v(S  U  {/})  +  v(S  U  {/})  -  u(S)] 

+  [u(S  U  {ij})  -v(S  U  {/})  +  v(S  U  {/})  -  v(S)]  , 

hence  x/rp(vp)  =  +  ^'(u)- 

Show  that  DUM  cannot  be  weakened  to  NP  by  finding  a  different  value  satisfying 
2-EFF,  SYM,  NP,  and  SMON. 


Problems  of  Chapter  21 


21.1  Anonymity  and  Symmetry 

An  example  of  a  symmetric  but  not  anonymous  solution  is  as  follows.  To  symmetric 
problems,  assign  the  point  in  W(S)  with  equal  coordinates;  otherwise,  assign  the 
point  of  S  that  is  lexicographically  (first  player  1,  then  player  2)  maximal. 


21.3  The  Nash  Solution  is  Well-Defined 

The  function  x  i->  (vi  —  d\)(x2~ df)  is  continuous  on  the  compact  set  {x  e  S  |  x  >  d} 
and  hence  attains  a  maximum  on  this  set.  We  have  to  show  that  this  maximum  is 
attained  at  a  unique  point.  In  general,  consider  two  points  z,  t!  e  {x  g  S  |  x  >  d} 
with  {z\  —  d\)(z2  ~  d^)  —  (z\  —  d\)(z!2  —  J2)  =  oi.  Then  one  can  show  that  at  the 
point  w  =  j(z  T  z')  G  S  one  has  ( w\  —  d\)(yv2  —  ^2)  >  This  implies  that  the 
maximum  is  attained  at  a  unique  point. 


21.4  (a)  =>  ( b )  in  Theorem  21.1 

WPO  and  IIA  are  straightforward  by  definition,  and  SC  follows  from  an  easy 
computation.  For  SYM,  note  that  if  N(S,  d)  =  z  for  a  symmetric  problem  ( S ,  d), 
then  also  (z2,Z\)  —  N(S,  d)  by  definition  of  the  Nash  bargaining  solution.  Hence, 
Z\  =  Z2  by  uniqueness. 


21.5  Geometric  Characterization  of  the  Nash  Bargaining  Solution 
Fet  (S,  d)  G  B  and  N(S,  d)  =  z.  The  slope  of  the  tangent  line  l  to  the  graph  of 
the  function  x\  \-+  (z\  —  d\)(z2  —  dfl/ixx  —  d\)  +  J2  (which  describes  the  level  set 
of  x  ^  (x\  —  d\)(x2  —  df)  through  z)  at  z  is  equal  to  —  (zi  ~  d2)/{z\  —  d\),  i.e., 
the  negative  of  the  slope  of  the  straight  line  through  d  and  z.  Clearly,  l  supports 
S  at  z:  this  can  be  seen  by  invoking  a  separating  hyperplane  theorem,  but  also  as 
follows.  Suppose  there  were  some  point  t!  of  S  on  the  other  side  of  t  than  d.  Then 
there  is  a  point  w  on  the  line  segment  connecting  t!  and  z  (hence,  w  G  S)  with 
(wi  —  d\)(yv2  —  df)  >  (zi  —  d\){z2  ~  ^2),  contradicting  z  =  N(S,  d).  The  existence 
of  such  a  point  w  follows  since  otherwise  the  straight  line  through  t!  and  z  would 
also  be  a  tangent  line  to  the  level  curve  of  the  Nash  product  at  z. 
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For  the  converse,  suppose  that  at  a  point  z  there  is  a  supporting  line  of  S  with 
slope  —  (z2  ~  d2)/{z,\  —  d\).  Clearly,  this  line  is  tangent  to  the  graph  of  the  function 
x\  i->  (z\  —  d\){z2  ~  d2)/(xi  —  d\)  +  d2  at  z.  It  follows  that  z  =  N(S,  d). 

21.6  Strong  Individual  Rationality 

The  implication  ( a )  =>>  ( b )  is  straightforward.  For  ( b )  =>>  (a),  if  F  is  also  weakly 
Pareto  optimal,  then  F  —  N  by  Theorem  21.1.  So  it  is  sufficient  to  show  that, 
if  F  is  not  weakly  Pareto  optimal  then  F  —  D.  Suppose  that  F  is  not  weakly 
Pareto  optimal.  Then  there  is  an  (S',  d)  e  B  with  F(S,  d)  £  fk(S).  By  IR, 
F(S,  d)  >  d.  Suppose  F(S,  d)  ^  d.  By  SC,  we  may  assume  w.l.o.g.  d  =  (0,0). 
Let  a  >  0  be  such  that  F(S,  (0, 0))  e  W((a,a)S).  Since  F(S,  (0, 0))  £  W(S), 
of  <  1.  So  (a,a)S  c  S.  By  IIA,  F((a,  a)S,  (0, 0))  =  F(S,(0,0)),  so  by  SC, 
F((a,  a)S,  (0, 0))  =  (a,  a)F(S,  (0, 0))  =  F(S,  (0, 0)),  contradicting  a  <  1.  So 
F(S,  (0,  0))  =  (0,  0).  Suppose  F(T,  (0, 0))  ^  (0, 0)  for  some  ( T ,  (0,  0))  e  B.  By 
SC  we  may  assume  (0,0)  ^  F(T,  (0, 0))  e  S.  By  IIA  applied  twice,  (0,0)  = 
F(S  fl  r,  (0,  0))  =  F(T,  (0, 0))  ^  (0,  0),  a  contradiction.  Hence,  F  —  D. 

21.7  ( a )  ( b )  in  Theorem  21.2 

Straightforward.  Note  in  particular  that  in  a  symmetric  game  the  utopia  point  is  also 
symmetric,  and  that  the  utopia  point  is  ‘scale  covariant’. 

21.8  Restricted  Monotonicity 

(a)  Follows  from  applying  IM  twice. 

(b)  For  (S,  d)  with  d  =  (0,  0)  and  u(S ,  d)  =  (1, 1),  let  F(S,  d)  be  the  lexicograph¬ 
ically  (first  player  1 ,  then  player  2)  maximal  point  of  S  PI  .  Otherwise,  let  F 
be  equal  to  R.  This  F  satisfies  RM  but  not  IM. 

21.9  Global  Individual  Monotonicity 

It  is  straightforward  to  verify  that  G  satisfies  WPO,  SYM,  SC,  and  GIM.  For  the 
converse,  suppose  that  F  satisfies  these  four  axioms,  let  (S,  d)  e  B  and  z  :=  G(S,  d). 
By  SC,  w.l.o.g.  d  =  (0, 0)  and  g(S)  =  (1, 1).  Let  a  <  0  such  that  S  c  S  where 
S  :=  {x  g  M2  [  (a,  o')  <  x  <  y  for  some  y  6  S}.  In  order  to  prove  F(S,  (0,  0))  = 
G(S,  (0, 0))  it  is  sufficient  to  prove  that  F(S,  (0,  0))  =  G(S,  (0,  0))  (in  view  of  GIM 
and  WPO).  Let  T  —  conv{z,  (of,  g2(S)),  (gi  (S),  a)}  =  conv{z,  (a,  1),  (l,af)}.  By 
SYM  and  WPO,  F(T,  (0,  0))  =  z.  By  GIM,  F(S,  (0,  0))  >  F(T,  (0, 0))  =  z  = 
G(5,  (0, 0))  =  G(S,  (0, 0)),  so  by  WPO:  F(S,  (0,  0))  =  G(S,  (0, 0)).  (Make  pictures. 
Note  that  this  proof  is  analogous  to  the  proof  of  Theorem  21.2.) 

21.10  Monotonicity  and  (Weak)  Pareto  Optimality 

(a)  Consider  problems  of  the  kind  ( conv{d,  a},  d)  for  some  a  >  d. 

(b)  The  egalitarian  solution  E  satisfies  MON  and  WPO  on  Z?o- 

21.11  The  Egalitarian  Solution  (1) 

Straightforward. 
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21.12  The  Egalitarian  Solution  (2) 

Let  z  :=  E(S,  d)  +  E(T ,  e).  Then  it  is  straightforward  to  derive  that  Z\  —  (d\  + 
e\)  —  Zi  —  (d2  +  e2 ).  Since  E(S  +  T,  d  +  e)  is  the  maximal  point  x  such  that 
x\  ~  (d\  +  e\)  —  X2  —  (d2  +  £2),  it  follows  that  E(S  +  T,  d  +  E)  >  z. 

21.13  Independence  of  Axioms 
Theorem  21.1: 

WPO,  SYM,  SC:  E  —  R\  WPO,  SYM,  IIA:  F  =  L,  where  L(S,  d)  is  the  point  of 
P(S)  nearest  to  the  point  z  >  d  with  z\—d\  —  Z2  —  <^2  measured  along  the  boundary 
of  S ;  WPO,  SC,  IIA:  F  —  Dl,  where  Dl(S,  d)  is  the  point  of  {x  e  P(S )  |  x  >  d} 
with  maximal  first  coordinate;  SYM,  SC,  IIA:  F  —  D  (disagreement  solution). 
Theorem  21.2: 

WPO,  SYM,  SC:  F  =  N;  WPO,  SYM,  IM:  F  =  L\  WPO,  SC,  IM:  if  d  =  (0, 0) 
and  u(S,  d)  =  (1, 1),  let  F  assign  the  point  of  intersection  of  W(S )  and  the  line 
segment  connecting  (1/4,  3/4)  and  (1,1)  and,  otherwise,  let  F  be  determined  by 
SC;  SYM,  SC,  IM:  F  —  D. 

Theorem  21.3: 

WPO,  MON,  SYM:  F(S,  d)  is  the  maximal  point  of  S  on  the  straight  line  through 
d  with  slope  1/3  if  d  =  (1,0),  F(S,  d)  =  E(S,  d)  otherwise;  WPO,  MON,  TC: 
F(S,  d)  is  the  maximal  point  of  S  on  the  straight  line  through  d  with  slope  1/3; 
WPO,  SYM,  TC:  F  =  N;  MON,  SYM,  TC:  F  =  D. 

21.14  Nash  and  Rubinstein 

(b)  The  Nash  bargaining  solution  outcome  is  (|  V3,  §),  hence  (|\/3, 1  —  is 
the  resulting  distribution  of  the  good. 

(c)  The  Rubinstein  bargaining  outcome  is 

(d)  The  outcome  in  (c)  converges  to  the  outcome  in  (b)  if  8  converges  to  1 . 


Problems  of  Chapter  22 

22.1  Convex  Sets 

The  only-if  part  is  obvious.  For  the  if-part,  for  any  two  vectors  x  and  y  in  Z  the 
condition  implies  that  ^-x  +  e  Z  for  every  m  e  N  and  k  e  {0, 1 , . . . ,  2m}.  By 

closedness  of  Z,  this  implies  that  conv{x,  y}  c  Z,  hence  Z  is  convex. 

22.2  Proof  of  Femma  22.3 

Suppose  that  both  systems  have  a  solution,  say  (y.  z)  >  0.  (y.  z)  7^  0,  Ay  +  z  =  0, 
x  >  0,  xA  >  0.  Then  xAy  +  x  •  z  =  x(Ay  +  z)  =  0,  hence  y  =  0  and  z  =  0  since 
x  >  0,  xA  >  0.  This  contradicts  (y,  z)  ^  0. 

22.3  Rank  of  AA T 

This  follows  from  basic  linear  algebra.  Prove  that  the  null  spaces  of  A  and  ATA  are 
equal  and  use  the  Rank  Theorem  to  conclude  that  the  rank  of  ArA,  and  thus  that  of 
AAr,  is  equal  to  the  rank  of  A. 


1-8  s-s3 


is3  ’  is3 


466 


24  Hints,  Answers  and  Solutions 


22.4  Proof  of  Lemma  22.5 

Suppose  that  both  systems  have  a  solution,  say  x  >  0,  xA  =  b,  Ay  >  0,  b  •  y  <  0. 
Then  xAy  <  0,  contradicting  x  >  0  and  Ay  >  0. 

22.5  Proof  of  Lemma  22.7 

(a)  If  x  >  0,  xA  <  b,  y  >  0  and  b  •  y  <  0  then  xAy  <  b  •  y  <  0,  so  Ay  ^  0.  This 
shows  that  at  most  one  of  the  two  systems  has  a  solution. 

(b)  Suppose  the  system  in  (a)  has  no  solution.  Then  also  the  system  xA  +  zl  —  b, 


b  •  y  <  0  has  a  solution.  Therefore,  the  system  in  (b)  has  a  solution. 


22.6  Extreme  Points 

The  implication  (b)  (a)  follows  by  definition  of  an  extreme  point. 

For  the  implication  (a)  =4>  (c),  let  x,y  G  C\{^}and0  <  A  <  l.Letz  =  Av+(1  —  X)y. 
If  z  f2  e  then  z  G  C  \  {e}  by  convexity  of  C.  Suppose  now  that  z  —  e.  W.l.o.g.  let 
A  >  1/2.  Then  e  —  Xx  +  (1  —  A )y  =  (1  / 2)x  H-  (1/2)  [fix  +  (1  —  fi)y\  for  /x  =  2A  —  1 . 
Since  fix-\-  (1  —fi)y  G  C,  this  implies  that  e  is  not  an  extreme  point  of  C.  This  proves 
the  implication  (a)  =>  (c). 

For  the  implication  (c)  =4>  (b),  let  x,  y  G  C,  v  ^  y,  and  0  <  a  <  l.lfx  —  eory  —  e 
then  clearly  ax  +  (1  —  a)y  ^  e.  If  v  ^  e  and  y  ^  e  then  ax  +  (1  —  a)y  G  C  \  {e} 
by  convexity  of  C  \  {e},  hence  ax  +  (1  —  a )y  e  as  well. 

22.7  Affine  Subspaces 

Let  A  =  a  +  L  be  an  affine  subspace,  x,y  G  A,  and  A  G  R.  Write  x  —  a  x  and 
y  —  a  +  y  for  x,y  G  L,  then  Xx  +  (1  —  A)y  =  a  +  Xx  +  (1  —  A)y  G  A  since 
Xx  +  (1  —  A)y  G  L  (L  is  a  linear  subspace). 

Conversely,  suppose  that  A  contains  the  straight  line  through  any  two  of  its  elements. 
Let  a  be  an  arbitrary  element  of  A  and  let  L  {x  —  a  \  x  G  A}.  Then  it  follows 
straightforwardly  that  L  is  a  linear  subspace  of  V,  and  thus  A  =  a  +  L  is  an  affine 
subspace. 

22.8  The  Set  of  Sup-points  of  a  Linear  Function  on  a  Convex  Set 

In  general,  linearity  off  implies  that,  if/(x)  =  /( y),  then/(Ax  +  (1  —  A)y)  =  /(x)  = 
/( y)  for  any  two  points  of  C  and  0  <  A  <  1 .  It  follows,  in  particular,  that  the  set  D 
is  convex. 

Let  e  g  ext(D)  and  suppose  e  =  (l/2)x  +  (1/ 2)y  for  some  x,  y  e  C.  Then  by 
linearity  of/, /(e)  =  (1/2)/ (x)  +  (l/2)/(y),  hence  x,  y  G  D  since  e  G  D.  So 
e  =  x  =  y  since  e  is  an  extreme  point  of  D.  Thus,  e  is  also  an  extreme  point  of  C. 
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Problems  of  Chapter  23 

RP  1  Matrix  Games  (1) 

(a)  All  rows  are  (pure)  maximin  strategies  (with  minimum  0)  and  all  columns  are 
pure  minimax  strategies  (with  maximum  2).  The  value  of  the  game  is  between 
0  and  2  (which  is  obvious  anyway  in  this  case). 

(b)  The  third  column  is  strictly  dominated  by  the  second  column  and  the  third  row 
is  strictly  dominated  by  the  second  row.  Entry  (1,2)  is  a  saddlepoint,  hence 
the  value  of  the  game  is  2.  The  unique  maximin  strategy  is  (1,0,0),  and  the 
minimax  strategies  are  the  strategies  in  the  set  {(q,\  —  q,  0)  |  0  <  g  <  1/2}. 

(c)  The  second  and  third  rows  are  the  maximin  rows.  The  second  column  is  the 
unique  minimax  column.  From  this  we  can  conclude  that  the  value  of  the 
game  is  between  1  and  2.  The  first  and  fourth  columns  are  strictly  dominated 
by  the  second.  Next,  the  first  row  is  strictly  dominated  by  the  last  row.  The 
unique  maximin  strategy  is  (0,2/3, 1/3)  and  the  unique  minimax  strategy  is 
(0, 2/3, 1/3, 0).  The  value  of  the  game  is  5/3. 

RP  2  Matrix  Games  (2) 

(a)  The  first  row  is  the  unique  maximin  row  (with  minimum  2)  and  both  columns 
are  minimax  columns  (with  maximum  5).  So  the  value  is  between  2  and  5.  The 
game  has  no  saddlepoint. 

(b)  v(A\ )  =  5/2,  v(A2)  —  20/7,  v(Ai)  =  2  (saddlepoint),  v(A4)  =  1 
(saddlepoint),  v(A5 )  =  7/3,  v(A^)  —  25/9.  Since  player  1  can  pick  rows,  the 
value  must  be  the  maximum  of  these  amounts,  hence  20/7,  the  value  of  A2. 

(c)  The  unique  maximin  strategy  is  (5/7, 0,  2/7,  0)  and  the  unique  minimax  strategy 
is  (3/7, 4/7). 

RP  3  Matrix  Games  (3) 

(a)  The  unique  maximin  row  is  the  first  row,  with  minimum  8.  The  unique  minimax 
column  is  the  first  column,  with  maximum  12.  So  the  value  of  the  game  is 
between  8  and  12.  The  game  has  no  saddlepoint. 

(b)  The  second  row  is  strictly  dominated  by  for  instance  putting  probability  1  /  2  on 
the  first  row  and  1  /2  on  the  third  row.  After  eliminating  the  second  row,  the  third 
column  is  strictly  dominated  by  the  first  column. 

(c)  The  unique  maximin  strategy  is  (1/2, 0,1/2)  and  the  unique  minimax  strategy 
is  (3/4, 1/4,  0).  The  value  of  the  game  is  10. 

RP  4  Bimatrix  Games  (1) 

(a)  D  is  strictly  dominated  by3/5-I/  +  2/5-M.  Next,  C  is  strictly  dominated  by  R. 

(b)  In  the  reduced  (two  by  two)  game,  the  best  reply  function  of  player  1  is:  play 
U  if  player  2  puts  less  than  probability  2/5  on  L,  play  M  if  player  2  puts  more 
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than  probability  2/5  on  L,  and  play  any  combination  of  U  and  M  if  player  2  puts 
probability  2/5  on  L.  The  best  reply  function  of  player  2  is:  play  R  if  player  1 
puts  positive  probability  on  U,  and  play  any  combination  of  L  and  R  if  player  1 
plays  M.  The  set  of  Nash  equilibria  is:  {((1,0),  (0,  1))}  U  {((0,  1),  (< q ,  1  —  q))  \ 
1  >q>  2/5}. 

(c)  The  set  of  Nash  equilibria  in  the  original  game  is:  {((1, 0, 0),  (0, 0,  1))}  U 
{((0,  1,0),  {q,  0,  1  —  q))  I  1  >q>2/5}. 

RP  5  Bimatrix  Games  (2) 

(a)  For  x  >  2:  {((1,0),  (1,0))}.  For  x  =  2:  {((1,0),  (1,0))}  U  {((p,  1  -  p),  (0, 1))  | 
0  <  p  <  1/2}.  For  0  <  x  <  2:  {((1/2, 1/2),  ((2  -  x)/2,x/2))}.  For 
x  =  0:  {((0, 1),  (0, 1))}  U  {((p,  1  -  p),  (1,0))  |  1  >  p  >  1/2}.  For  *  <  0: 
{((0,1),  (0,1))}. 

(b)  /  is  strictly  dominated  byl/3-^  +  2/3  -g.  Next:  b  is  strictly  dominated  by  c,  e  by 
g ,  a  by  d.  The  remaining  two  by  two  game  has  a  unique  Nash  equilibrium.  In  the 
original  game  the  unique  Nash  equilibrium  is  ((0,0,4/9,  5/9),  (0,0, 1/2, 1/2)). 

RP  6  Voting 
(a) 


(4,0)  (3,1) 

(4,0)  /  3/2,  3/2  1,2 

(3.1)  2,1  3/2, 3/2 

(2.2)  I  2,1  2,1 


(2,2) 


1,2 

1,2 


3/2, 3/2 


(b)  By  iterated  elimination  of  strictly  dominated  strategies  it  follows  that  the  unique 
Nash  equilibrium  in  this  game  is  ((2, 2),  (2,  2)).  (This  is  a  constant  sum  game: 
(2,  2)  is  the  optimal  strategy  for  each  party.) 


RP  7  A  Bimatrix  Game 


(a)  For  a  /  0  the  unique  Nash  equilibrium  is  ((1/2, 1/2),  (1/2, 1/2)).  For  a  —  0 
the  set  of  Nash  equilibria  is  {(( p ,  1  —  p),  (0,  1))  |  1  >  p  >  1/2}  U  {((p,  1  — 
p),  (1, 0))  |  0  <p<  1/2}  U  {((1/2,  1/2),  (q,l-q))\0<q<  1}. 

(b)  The  strategic  form  of  this  game  is 


T 

B 


LL 

LR 

RL 

RR 

a ,  0 

a,  0 

0,1 

0,1\ 

0,1 

a,  0 

0,1 

a,  0  r 

There  are  two  subgame  perfect  equilibria  in  pure  strategies:  player  1  plays  T  and 
player  2  plays  RL  (i.e.,  R  after  T  and  L  after  B );  and  player  1  plays  B  and  player 
2  plays  RL. 
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RP  8  An  Ice-Cream  Vendor  Game 

(a)  There  are  four  different  situations:  (i)  all  vendors  in  the  same  location:  each  gets 
400;  (ii)  two  in  the  same  location  and  the  third  vendor  in  a  neighboring  location: 
the  first  two  get  300  and  the  third  gets  600;  (iii)  two  in  the  same  location  and  the 
third  vendor  in  the  opposite  location:  the  first  two  get  300  and  the  third  gets  600; 
and  (iv)  all  vendors  in  different  locations:  the  middle  one  gets  300  and  the  others 
get  450  each.  From  this  it  is  easily  seen  that  (iii)  and  (iv)  are  Nash  equilibria  but 

(i)  and  (ii)  are  not  Nash  equilibria. 

(b)  There  are  many  subgame  perfect  Nash  equilibria,  but  they  can  be  reduced 
to  three  types:  (i)  player  1  chooses  arbitrarily,  player  2  chooses  the  opposite 
location  of  player  1 ,  and  player  3  chooses  a  remaining  optimal  open  location;  (ii) 
player  1  chooses  arbitrarily,  player  2  chooses  one  of  the  neighboring  locations 
of  player  1,  and  player  3  chooses  the  opposite  location  of  player  2  if  that  is 
unoccupied,  and  otherwise  the  same  location  as  player  2;  (iii)  player  1  chooses 
arbitrarily,  player  2  chooses  the  same  location  as  player  1 ,  and  player  3  chooses 
the  opposite  location  of  player  1 . 

RP  9  A  Repeated  Game 

(a)  (U,L,B)  and  (D,R,B). 

(b)  In  the  second  period,  after  each  action  combination  of  the  first  period,  one  of  the 
two  equilibria  in  (a)  has  to  be  played. 

(c)  In  the  first  period  player  1  plays  U ,  player  2  plays  R ,  and  player  3  plays  A.  In 
the  second  period,  if  the  first  period  resulted  in  (U,  R,A)  then  player  1  plays 
D,  player  2  plays  R,  and  player  3  plays  B;  in  all  other  cases,  player  1  plays  U, 
player  2  plays  L,  and  player  3  plays  B. 

(d)  In  the  first  period  player  1  plays  U,  player  2  plays  R ,  and  player  3  plays  B.  In 
the  second  period,  if  the  first  period  resulted  in  ( U ,  R,  B)  then  player  1  plays  U, 
player  2  plays  L,  and  player  3  plays  B\  in  all  other  cases,  player  1  plays  D ,  player 
2  plays  R ,  and  player  3  plays  B. 

RP  10  Locating  a  Pub 

(a)  Player  1  has  three  pure  strategies  and  player  2  has  eight  pure  strategies. 

(b)  Player  1  chooses  B.  Player  2  chooses  B ,  C,  B ,  if  player  1  chooses  A,  B ,  C 
respectively. 

(c)  Player  1  has  24  pure  strategies  and  player  2  has  8  pure  strategies. 

(d)  (i)  Player  1  plays  A;  after  A  the  subgame  equilibrium  (. B ,  C)  is  played,  after  B 
the  subgame  equilibrium  (A,  C),  and  after  C  the  subgame  equilibrium  ( A,B ). 

(ii)  Player  1  plays  B\  after  A  the  subgame  equilibrium  (B,  C )  is  played,  after  B 
the  subgame  equilibrium  (C,A),  and  after  C  the  subgame  equilibrium  ( A,B ). 

(iii)  Player  1  plays  C;  after  A  the  subgame  equilibrium  (B,  C )  is  played,  after  B 
the  subgame  equilibrium  (C,  A),  and  after  C  the  subgame  equilibrium  ( B,A ). 


470 


24  Hints,  Answers  and  Solutions 


RP  11 A  Two-Stage  Game 

(a)  In  Gi:  (D,tf);  in  G2:  (T,X),  (AT,  F),  and  (£,Z). 

(b)  Each  player  has  2  •  34  =  162  pure  strategies. 

(c)  In  Gi  player  1  plays  U  and  player  2  plays  L.  In  G2  the  players  play  as  follows. 
If  ( U ,  L)  was  played,  then  player  1  plays  M  and  player  2  plays  F.  If  (D,  L)  was 
played,  then  player  1  plays  B  and  player  2  plays  Z.  If  (U,  R )  was  played,  then 
player  1  plays  T  and  player  2  plays  X.  If  ( D ,  R )  was  played,  then  player  1  plays 
M  and  player  2  plays  F. 

(d)  In  the  second  stage  (in  G\)  always  (U,  L)  has  to  be  played.  Hence,  there  are  three 
subgame  perfect  equilibria,  corresponding  to  the  three  Nash  equilibria  of  G2. 

RP  12  Job  Market  Signaling 

(a) 


4,4 

10,0 


4,4 

(b)  The  Nash  equilibria  are:  (i)  type  H  plays  E ,  type  L  plays  N,  F  plays  M  after  E 
and  C  after  N;  (ii)  both  types  play  N,  F  always  plays  C. 

(c)  The  equilibrium  in  (i)  is  separating  with  (forced)  beliefs  a  —  1  and  /3  =  0.  The 
equilibrium  in  (ii)  is  pooling  with  /3  =  1/3  (forced)  and  a  <  2/5.  According  to 
the  intuitive  criterion  we  must  have  a  =  1 ,  so  that  the  intuitive  criterion  is  not 
satisfied  by  the  latter  equilibrium.  (It  does  not  apply  to  the  first  equilibrium.) 
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RP  13  Second-Hand  Cars  (1) 

(a,  b)  The  extensive  form  of  this  signaling  game  is  as  follows: 


The  strategic  form  is: 


bb 

bn 

nb 

nn 

CC  1 

( 10,5 

10,5 

0,0 

0,0\ 

CN 

15,0 

7.5, 2.5 

7.5, -2.5 

0,0 

NC 

10,5 

2.5, 2.5 

7.5,  2.5 

0,0 

NN  ' 

15,0 

0,0 

15,0 

0,0) 

The  Nash  equilibria  are:  (CC,  bn),  (NN,  bb),  (NN,  nb),  (NN,  nn). 

(c)  (CC,  bn)  is  pooling  with  /3  <  1/2,  (NN,  bb)  is  pooling  for  all  a.  The  other 
two  equilibria  are  not  perfect  Bayesian,  since  player  2  will  play  b  after  C. 

RP  14  Second-Hand  Cars  (2) 

(a)  This  is  a  static  game  of  incomplete  information,  represented  by  the  pair  G\ ,  G2 : 


13  5  13  5 

1  / 1,  — 1  0,0  0,0  \  1  /  3,  —3  0,0  0,0  \ 

Gi  =  3  0,0  -1,1  0,0  G2  =  3  2,-2  1,-1  0,0 

5  y  — 1,1  -2,2  -3,3/  5  \  1,  — 1  0,0  —1,1/ 

where  G\  is  played  with  probability  25  %  and  G2  with  probability  75  %.  (The 
numbers  should  be  multiplied  by  1,000,  the  buyer  is  the  row  and  the  seller  the 
column  player.) 

(b)  The  buyer  has  one  type  and  three  pure  strategies,  the  seller  has  two  types  and 
nine  pure  strategies. 

(c)  Strategy  “5”  is  strictly  dominated  by  strategy  “3”. 

(d)  Against  strategy  “3”  of  the  buyer  the  best  reply  of  the  seller  is  the  combination 
(3,5),  but  against  this  combination  the  best  reply  of  the  buyer  is  “1”. 
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(e)  Against  strategy  “1”  of  the  buyer  the  seller  has  four  best  replies:  (3,3),  (3,5), 
(5,  3),  and  (5,  5).  In  turn,  (only)  against  (3,  5)  and  (5,  5)  is  “1”  a  best  reply. 
Hence  there  are  two  Nash  equilibra  in  pure  stategies:  (i)  (1,(3,  5))  and  (ii) 
(1,  (5,  5)).  No  trade  is  going  to  take  place. 

RP  15  Signaling  Games 

(a)  The  strategic  form  with  best  replies  underlined  is: 


UU 

ud 

du 

dd 

LL  i 

(  2,1 

2,1 

1.5, 0.5 

1.5,0.5\ 

LR 

2.5, 1.5 

1.5, 1 

2,0.5 

1,0 

RL 

1,0 

0.5,  0.5 

1,0.5 

0.5,1 

RR  ' 

^  1.5, 0.5 

0,0.5 

1.5, 0.5 

0,0.5  / 

( LR ,  uu )  is  a  separating  perfect  Bayesian  equilibrium  with  beliefs  a  —  1  and 
/3  =  0.  (LL,  ud)  is  a  pooling  Bayesian  equilibrium  with  beliefs  a  —  1/2  and 
P  >  1/2.  For  the  latter,  the  intuitive  criterion  requires  P  =  0,  so  that  this 
equilibrium  does  not  satisfy  it. 

(b)  The  strategic  form  with  best  replies  underlined  is: 


UU 

ud 

du 

dd 

LL 

(  3,1.5 

3, 1.5 

0.5, 1 

0.5,1  \ 

LR 

2,1 

2.5,0 

1, 1.5 

1.5, 0.5 

RL 

1.5, 1.5 

3.5,2 

0,0.5 

2,1 

RR  ' 

(  0.5,1 

3,0.5 

0.5,1 

3,0.5  ) 

(LL,  uu)  is  a  pooling  perfect  Bayesian  equilibrium  with  beliefs  a  —  1/2  and 
P  <  2/3.  The  intuitive  criterion  requires  P  =  1,  so  this  pooling  equilibrium 
does  not  satisfy  it.  (LR,  du )  is  a  separating  perfect  Bayesian  equilibrium  with 
beliefs  a  —  1  and  P  —  0,  and  (RL,  ud)  is  a  separating  perfect  Bayesian 
equilibrium  with  beliefs  a  —  0  and  P  —  1 . 


RP  16  A  Game  of  Incomplete  Information 


(a)  Start  with  the  decision  node  of  player  1 .  Player  1  has  four  actions/strategies:  AA , 
AB ,  BA,  BB.  All  these  actions  lead  to  one  and  the  same  information  set  of  player 
2,  who  has  three  actions/strategies:  C,  D,  E. 
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(b)  The  strategic  form  is: 


c 

D 

E 

AA 

(  3,2 

1.5, 1.5 

2.5, 1.5  \ 

AB 

4,2.5 

2.5,3 

1, 1.5 

BA 

3,2 

1.5, 1.5 

2.5, 1.5 

BB  ' 

(4,2.5 

2.5,3 

1,1.5  ) 

The  Nash  equilibria  in  pure  strategies  are  ( AB ,  D )  and  ( BB ,  D ). 

(c)  Player  1  has  now  two  pure  strategies,  namely  A  and  B.  If  player  1  plays  A  then 
the  best  reply  of  player  2  is  EC.  Against  EC ,  the  payoff  of  A  is  1 .5  and  the  payoff 
of  B  is  2.5,  so  that  A  is  not  a  best  reply  against  EC.  Against  B ,  the  best  reply  of 
player  2  is  ED.  In  turn,  B  is  player  l’s  best  reply  against  ED  (yields  2  whereas 
A  only  yields  1).  So  the  unique  Nash  equilibrium  in  pure  strategies  is  ( B ,  ED). 

RP  17  A  Bayesian  Game 

(a)  This  is  the  game 


F  Y 

F  (- 1,1  1,0\ 

TV  0,1  0,0  ) 

with  (y,  F)  as  unique  Nash  equilibrium  (also  in  mixed  strategies). 

(b)  Start  with  the  decision  node  for  player  1,  who  has  two  actions/strategies:  F  and 
Y.  Player  2  has  a  singe  information  set  and  four  actions/strategies:  FF ,  FY,  YF , 
YY. 

(c)  The  strategic  form  is: 

FF  FY  YF  YY 

F  f  1  —  2a,  2a  —  1  1  —  2a,  a  1 ,  a  —  1  1 , 0  \ 

y  V  0, 1  0, a  0, 1  -a  0,0 )' 

For  a  =  0  the  Nash  equilibria  in  pure  strategies  are  (F,  FY)  and  (F,  YY).  For 
0  <  a  <  1/2:  (F,  FY).  For  a  =  1/2:  (F,  FY)  and  (Y,  FF).  For  1/2  <  a  <  1: 
(y,  FF).  For  a  =  1 :  {Y,  FF)  and  {Y,  FY). 
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RP  18  Entry  as  a  Signaling  Game 

(a)  The  extensive  form  of  this  signaling  game  is: 


(b,  c)  The  strategy  combination  (pu ,  af )  (strong  type  p ,  incumbent  a  after  p)  is  a 
Nash  equilibrium.  It  is  a  separating  perfect  Bayesian  equilibrium  for  a  —  1 
and  P  —  0.  Also  ( uu,jf )  is  a  Nash  equilibrium.  It  is  pooling  perfect  Bayesian 
for  P  —  1/2  and  a  <  1/2.  It  does  not  satisfy  the  intuitive  criterion  since  that 
requires  a  —  1 . 

RP  19  Bargaining  (1) 

(a)  Player  1  has  only  one  type.  Player  2  has  infinitely  many  types,  namely  each 
v  e  [0, 1]  is  a  possible  type  of  player  2.  A  typical  strategy  of  player  1  consists 
of  a  price  p\  e  [0, 1]  and  a  yes/no  decision  depending  on  the  price  p2  of  player 
2  if  that  player  rejects  p \ — in  principle,  the  yes/no  decision  may  also  depend  on 

Pi¬ 
th)  A  typical  strategy  of  player  2  consists,  for  every  type  v  e  [0, 1],  of  a  yes/no 
decision  depending  on  the  price  p\  asked  by  player  1  and  a  price  p2  in  case 
the  decision  was  ‘no’.  In  principle,  p2  may  also  depend  on p\  (not  only  via  the 
yes/no  decision). 

(c)  Player  2  accepts  if  v  —  p\  >  8v  (noting  that  he  can  offer  P2  —  0  if  he  does  not 
accept  the  price  p\  of  player  1);  rejects  and  offers  p2  —  0  if  v  —  p\  <  8v. 

(d)  Using  (c)  player  1  asks  the  price  p\  that  maximizes  p\  •  Pr\p\  <  (1  —  8)v],  i.e., 
his  expected  payoff — note  that  his  payoff  is  0  if  player  2  rejects.  Hence,  player 
1  solves  maXpl€[o,i]  p\  •  [1  —  p\/{\  —  5)],  which  has  solution p\  —  (1  —  S)/2.  So 
the  equilibrium  is,  that  player  1  asks  this  price  and  accepts  any  price  of  player 
2;  and  player  2  accepts  any  price  at  most  (1  —  5) / 2,  and  rejects  any  higher  price 
and  then  offers  p2  —  0. 
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RP  20  Bargaining  (2) 

(a)  The  (Pareto)  boundary  of  the  feasible  set  consists  of  all  pairs  (x,  1  —  x2)  for 
x  G  [0, 1]. 

(b)  The  Nash  bargaining  solution  outcome  is  found  by  maximizing  the  expression 
x(l  —  x2)  over  all  x  G  [0, 1].  The  solution  is  ((1/3)  \/3,  2/3).  In  distribution 
of  the  good:  ((1/3)V3, 1  —  (1/3)V3). 

(c,  d)  Let  (v,  1  —  x2)  be  the  proposal  of  player  1  and  (y,  1  —  y2)  that  of  player  2. 
Then  the  equations  1  —  x2  =  5(1  —  y2)  and  y  =  8x  hold  for  the  Rubinstein 
outcome.  This  results  in  x  —  1  /  Vl  +  8  +  S2;  taking  the  limit  for  8  1  gives 

(1/3)  y/3,  which  is  indeed  the  Nash  bargaining  solution  outcome  for  player  1 . 

RP  21  Bargaining  (3) 

(a)  Player  1  proposes  (1  —  8  +  (1/ 2)82, 8  —  (1/ 2 )82)  at  t  —  0  and  player  2  accepts. 
Note  that  1  -  8  +  (1/2 )82  >  8  -  (1/2 )82,  so  the  beginning  player  has  an 
advantage. 

(b)  If  the  utility  function  of  player  2  were  the  same  as  that  of  player  1,  then  the 
Nash  bargaining  solution  would  result  in  equal  split.  This  is  still  the  case  if 
player  2’s  utility  function  is  multiplied  by  2,  as  is  the  case  here:  the  maximum 
of  u(x)-2u(l  —x)  is  attained  at  the  same  point  as  the  maximum  of  u(x)  •  u(l  —  x). 
So  the  division  of  the  good  is  (1/2, 1/2).  In  terms  of  utilities,  this  gives 
(u(  1  / 2),  2u(l / 2)).  (The  Nash  bargaining  solution  is  symmetric,  Pareto  optimal, 
and  scale  co variant:  see  Chap.  10.) 

RP  22  Ultimatum  Bargaining 

(a)  Player  1  chooses  an  action/strategy  (1  —  m,  m).  Player  2  decides  for  each  strategy 
of  player  1  whether  to  accept  or  reject  the  offer.  If  he  accepts,  the  payoffs  are 
(1  —  m,  m  +  a (2m  —  1)),  otherwise  the  payoffs  are  (0, 0). 

(b)  Player  1  proposes  (1  —a/ (1+2 a),  a/ (1  +  2a)),  and  player  2  accepts  (1  —  m,  m)  if 
and  only  ifra  >  a/{  1  +  2a).  Hence,  the  outcome  is  (1  —  a/ (l  +2a),a/(l  +2 a)). 

(c)  If  a  becomes  large,  then  this  outcome  converges  to  equal  split:  this  is  because 
then  player  2  cares  mainly  about  the  division  and  not  so  much  about  what  he 
gets. 

RP  23  An  Auction  (1) 

(a)  The  game  has  imperfect  but  complete  information. 

(b)  The  unique  Nash  equilibrium  is  each  bidder  bidding  v\  =  V2- 
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(c)  There  is  no  Nash  equilibrium. 

(d)  The  associated  bimatrix  game  is: 


0  12  3 


0 

/ 1/2, 3/2 

0,2 

0,1 

0,0  \ 

1 

0,0 

0,1 

0,1 

0,0 

2 

-1,0 

-1,0 

-1/2, 1/2 

0,0  ■ 

3  ' 

iy  -2,0 

-2,0 

-2,0 

-1,0/ 

The  Nash  equilibria  are  (0, 1),  (1, 1),  and  (1,2). 

RP  24  An  Auction  (2) 

(a)  Let  bi  <  Vi.  If  bi  wins  then  Vi  is  equally  good.  If  bi  loses  and  the  winning  bid 
is  below  Vi  then  V[  is  a  strict  improvement.  If  bi  loses  and  the  winning  bid  is  at 
least  Vi  then  Vi  is  at  least  as  good.  If,  on  the  other  hand,  bi  >  v^  then,  if  bi  wins, 
the  fourth-highest  bid  is  below  Vi  and  the  second  highest  bid  is  above  vu  then 
bidding  u/  results  in  zero  instead  of  positive  payoff. 

(b)  For  instance,  player  2  can  improve  by  any  bid  above  V\ . 

(c)  All  bidders  bid  v  where  v  e  [v2,v\\. 

RP  25  An  Auction  (3) 

(a)  The  best  reply  function  Z?2  of  player  2  is  given  by:  /^(O)  =  {1},  Ml)  =  {2}, 
*2(2)  =  {3}%2(3)  =  *2(4)  =  {0,..  ,4},M5)  =  {0,...,5}.*2(6 )  = 
{0, . . . ,  6}.  The  best  reply  function  b\  of  player  1  is  given  by:  £>i(0)  =  {0}, 

Ml)  =  {1},  M2)  =  {2},  M3)  =  {3},  M4)  =  {4},  M5)  =  {5}, 

b\(6)  =  {0 . 6}. 

(b)  The  Nash  equilibria  are:  (3,3),  (4,4),  (5,5),  and  (6,  6). 

RP  26  Quantity  Versus  Price  Competition 

(a)  The  profit  functions  are  q\{4  —  2q\  —  <72)  and  M 4  —  q\  —  2M  respectively 
(or  zero  in  case  an  expression  is  negative).  The  first-order  conditions  (best 
reply  functions)  are  q\  —  (4  —  <72) /4  and  <72  =  (4  —  <7i )/4  (or  zero)  and  the 
equilibrium  is  <71  =  <72  =  4/5  with  associated  prices  equal  to  8/5  and  profits 
equal  to  32/25. 

(b)  Follows  by  substitution. 

(c,  d)  The  profit  functions  are  (\/3)p\{p2  ~  2p\  +  4)  and  (l/3)p2(pi  —  2p2  +  4) 
(or  zero)  respectively.  The  first-order  conditions  (best  reply  functions)  are 
Pi  =  ipi  +  4)/4  and  P2  —  (pi  +  4)/4.  The  equilibrium  is  p\  —  p2  —  4/3 
with  associated  quantities  <71  =  <72  =  8/9  and  profits  equal  to  32/27 .  Price 
competition  is  tougher. 
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RP  27  An  Oligopoly  Game  (1) 

(a,  b)  Player  1  chooses  q\  >0.  Players  2  and  3  then  choose  q2  and  <73  simultane¬ 
ously,  depending  on  q\.  The  best  reply  functions  of  players  2  and  3  in  the 
subgame  following  q\  are  <72  =  (a  —  q\  —  <73  —  c)/2  and  {a  —  q\  —  <72  —  c)/2  (or 
zero),  and  the  equilibrium  in  the  subgame  is  ^2  =  <73  =  (a  —  q\—c)/3.  Player 
1  takes  this  into  account  and  maximizes  q\(a  —  c  —  q\—  2 (a  —  q\—  c)/3),  which 
gives  qi  =  (a  —  c)/2.  Hence,  the  subgame  perfect  equilibrium  is:  player  1 
plays  q\  —  (a  —  c)/ 2;  players  2  and  3  play  <72  =  qi  —  (a  —  q\  —  c)/ 3.  The 
outcome  is  player  1  playing  (a  —  c)/ 6  and  players  2  and  3  playing  (a  —  c)/ 6. 

RP  28  An  Oligopoly  Game  (2) 

(a)  The  best-reply  functions  are  q\  —  (10  —  <72  —  <^3) / 2,  <72  =  (10  —  <71  —  <73) / 2, 
<73  =  (9  —  <71  -qi)/2. 

(b)  The  equilibrium  is  <71  =  <72  =  11/4,  <73  =  7/4. 

(c)  To  maximize  joint  profit,  g3  =  0  and  q\  +  <72  =  5.  (This  follows  by  using 
intuition:  firm  3  has  higher  cost,  or  by  solving  the  problem  as  a  maximization 
problem  under  nonnegativity  constraints.) 

RP  29  A  Duopoly  Game  with  Price  Competition 

(a)  The  monopoly  price  of  firm  1  is  p\  —65  and  the  monopoly  price  of  player  2  is 
Pi  =  75. 

(b) 


PliPl )  = 


{x  X 

>  Pi) 

if  P2 
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30 

{x  |  X 

>  Pi  } 

if  Pi 
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— 
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51 
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{ Pi  - 

1} 

if  Pi 

e 

[52,75] 

{65} 

ifp2 

> 

66 

,{75} 

if  Pi 

> 

76 

(c)  (puPi)  =  (31,32). 

(d)  (pi,p2)  =  (50,51). 


RP  30  Contributing  to  a  Public  Good 


(a)  The  Nash  equilibria  in  pure  strategies  are  all  strategy  combinations  where 
exactly  two  persons  contribute. 

(b)  The  expected  payoff  of  contributing  is  equal  to  —3  +  8(1  —  (1  —  p)2),  which  in 
turn  is  equal  to  1 6p  —  8 p2  —  3. 

(c)  A  player  should  be  indifferent  between  contributing  or  not  if  the  other  two 
players  contribute,  hence  1 6p  —  8 p1  —  3  =  8 p2.  This  holds  for  p  =  1/4  and 
for  p  —  3/4. 


478 


24  Hints,  Answers  and  Solutions 


RP  31 A  Demand  Game 

(a)  Not  possible:  each  player  can  gain  by  raising  his  demand  by  0.1.  (b)  Not 
possible:  at  least  one  player  has  X[  >  0.2  and  can  gain  by  decreasing  his 
demand  by  0.2.  (c)  The  unique  Nash  equilibrium  is  (0.5,  0.5, 0.5).  (d)  A  Nash 
equilibrium  is  for  instance  (0.6,  0.6,  0.6). 

(e)  All  triples  with  sum  equal  to  one,  and  all  triples  such  that  the  sum  of  each  pair 
is  at  least  one. 

RP  32  A  Repeated  Game  (1) 

(a)  The  unique  Nash  equilibrium  in  the  stage  game  is  ((2/3, 1/3),  (1/2, 1/2)), 
with  payoffs  (8, 22).  Therefore,  all  payoffs  pairs  in  the  quadrangle  with  vertices 
(16, 24),  (0,  25),  (0, 18),  and  (16, 16)  which  are  strictly  larger  than  (8,  22),  as 
well  as  (8,  22),  can  be  reached  as  long  run  average  payoffs  in  a  subgame  perfect 
equilibrium  in  the  repeated  game,  for  suitable  choices  of  8. 

(b)  Write  G  =  ( A,B ),  then  v(A )  =  8  and  —  v(— B)  —  18.  Therefore,  all  payoffs 
pairs  in  the  quadrangle  with  vertices  (16,24),  (0,25),  (0,18),  and  (16,16) 
which  are  strictly  larger  than  (8, 20),  can  be  reached  as  long  run  average  payoffs 
in  a  Nash  equilibrium  in  the  repeated  game,  for  suitable  choices  of  8. 

(c)  The  players  alternate  between  (T,  L)  and  ( B,R ).  Player  1  has  no  incentive 
to  deviate,  but  uses  the  eternal  punishment  strategy  B  to  keep  player  2  from 
deviating.  Player  2  will  not  deviate  provided 

25  +  183/(1  -  3)  <  24/(1  -  32)  +  163/(1  -  32) 


and 


18  +  183/(1  -  3)  <  16/(1  -  32)  +  243/(1  -  32)  . 

The  first  inequality  is  satisfied  if  8  is  at  least  (approximately)  0.55,  and  the 
second  inequality  if  8  >  1/3.  Hence,  this  is  a  Nash  equilibrium  for  8  >  0.55.  It 
is  not  subgame  perfect  since  player  2  can  obtain  22  by  playing  the  stage  game 
equilibrium  strategy. 

RP  33  A  Repeated  Game  (2) 

(a)  (D,  C),  ( D,R ),  and  (M,R). 

(b)  Let  ((p\,p2,  Ps) ,  (<7i ,  qi ,  q?))  be  a  Nash  equilibrium.  First  consider  the  case  q 3  < 
1.  Then  p\  —  0  and  therefore  q\  —  0.  If  P2  >0  then  q2  —  0  and  q2  —  1,  a 
contradiction.  Hence,  P2  —  0,  and  then  p2  —  1.  We  obtain  the  set  of  Nash 
equilibria  {((0, 0, 1),  (0,  q2.  q2))  \  q2,  <?3  >  0,  <72  +  <?3  =  1,  <73  <  1}- 

Next,  consider  the  case  q2  —  1 .  Then  9p\  +P2  +  4p?>  <  pi  +  2p2  +  4^3,  hence 
Sp\  <  p2 .  We  obtain  another  set  of  Nash  equilibria  {((p\,p2,P3),  (0,0,  1))  [ 
Pi  >  0,  8/7 1  <  P2,Pl  +P2=  1}. 
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(c)  Each  player  has  3  x  39  =  310  pure  strategies.  In  the  first  stage  the  players  play 
( U ,  L)  and  in  the  second  stage  they  play  (for  instance)  according  to  the  table 

L  C  R 

U  / D,R  M,  R  D,R\ 
m\d,c  D,R  D,R  . 

D  \D,C  D,R  D,R  j 


(d)  Always  play  ( U,L )  but  after  a  deviation  by  player  1,  player  2  reverts  to  C 
forever,  to  which  player  1  replies  by  D ,  and  after  a  deviation  by  player  2,  player 
1  reverts  to  M  forever,  to  which  player  2  replies  by  R.  This  is  a  subgame  perfect 
equilibrium  provided  that 


10  +  28/(1  -8)  <  8/(1  -8)  O  8>  1/4 


and 


9  +  28/(1  -  8)  <  8/(1  -  8)  8  >  1/7 


hence  if  8  >  1/4. 


RP  34  A  Repeated  Game  (3) 

(a)  (D,L),(U,R),znd(D,R). 

(b)  The  second  row  and  next  the  second  column  can  be  deleted  by  iterated  elimina¬ 
tion  of  strictly  dominated  strategies.  This  results  in  the  sets  of  Nash  equilibria 
{((0,  0,  1),  (q\,  0,  g3))  |  qi,q3  >  0,qi  +  q3  =  1}  and  {(Oi,  0,/?3),  (0, 0,  1))  | 
Pi,P3  >  0, p\  +p3  =  1}. 

(c)  In  the  first  stage  the  players  play  (M,  C)  and  in  the  second  stage  they  play  (for 
instance)  according  to  the  table 


L 

U  / D,R 
M  \u,R 
D  \D,R 


C  R 

D,  L  D,R\ 
D,R  D,R  . 
D,  L  D,R  ) 


(d)  Always  play  (M,  C)  but  after  a  deviation  by  player  1  player  2  reverts  to  L  forever, 
to  which  player  1  replies  by  D ,  and  after  a  deviation  by  player  2  player  1  reverts 
to  U,  to  which  player  2  replies  by  R.  This  is  a  subgame  perfect  equilibrium 
provided  that 


12  +  5/(1  -8)  <  10/(1  -8) 


which  holds  for  8  >  2/11. 
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RP  35  A  Repeated  Game  (4) 

(a)  Player  1  plays  B  and  player  2  plays  L  in  both  stages. 

(b)  They  play  ( T ,  L)  in  the  first  stage.  If  player  1  would  deviate  to  B ,  then  player  2 
plays  R  in  the  second  stage,  otherwise  L.  Player  1  plays  B  in  the  second  stage. 

(c)  Since  ( B ,  L)  is  the  unique  Nash  equilibrium  in  the  stage  game  and  there  are  no 
payoff  pairs  better  for  both  players,  the  only  possibility  is  that  player  1  plays 
B  and  player  2  plays  L  forever.  This  is  a  subgame  perfect  equilibrium  for  any 
value  of  8,  with  long  run  average  payoffs  (5,5). 

RP  36  A  Repeated  Game  (5) 

(a)  Only  (T,L). 

(b)  The  payoff  pair  (2,1),  and  all  payoff  pairs  larger  for  both  players  in  the  triangle 
with  vertices  (5,  0),  (0,  6),  and  (1,1). 

(c)  At  even  times  play  (B,  L)  and  at  odd  times  play  ( T ,  R).  After  a  deviation  revert  to 
T  (player  1)  and  L  (player  2)  forever.  This  is  a  subgame  perfect  Nash  equilibrium 
provided  that 


2  +  25/(1  -8)  <  58/(1  -  82) 


and 


1+S/(1-S)  <  68/(1  -  82) 

which  is  equivalent  to  8  >  max{2/3, 1/5}  =  2/3. 

RP  37  An  Evolutionary  Game 

(a)  The  species  consists  of  100 p%  animals  of  type  C  and  100(1  —  p)%  animals  of 
type  D. 

(b)  p  —  p(0p-\-2(l—p)  —  2p(l—p)  —  3(l—p)p  —  (l—p)2)  which  after  simplification 
yields  p  —  4 p(p  —  1  )(p  —  1/4).  Hence  the  rest  points  are  p  —  0, 1/4, 1  and 
p  —  1  /4  is  stable. 

(c)  The  unique  symmetric  Nash  equilibrium  strategy  is  (1/4,  3/4).  One  has  to  check 
that  (1/4,  3/4 )A(q,  1  —q)  >  (q,  1  —  q)A(q,  1  —q)  for  all  q  ^  1/4,  which  follows 
readily  by  computation. 

RP  38  An  Apex  Game 

(a)  Suppose  (x\, ... ,  V5)  is  in  the  core.  Since x\  +  X2  >  1 ,  and  all X[  are  nonnegative 
and  sum  to  one,  we  must  have  V3  =  X4  —  X5  =0.  Similarly,  X2  =  0,  but  this 
contradicts  X2  +  . . .  +  V5  >  1.  So  the  core  is  empty. 

(b)  02 (v)  =  l!3!/5!  +  3!l!/5!  =  1/10,  hence  0(v)  =  (6/10,1/10,1/10,1/10, 
1/10). 
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(c)  Let  (1  —  4 a,  a,  a,  a,  a)  be  the  nucleolus  of  this  game.  The  relevant  (maximal) 
excesses  to  consider  are  1— (1— 4 a)— a  —  3 a  (e.g.,  {1,2})  and  1—4 a  ({2, ... ,  5}). 
Equating  these  yields  a  —  1/7. 

RP  39  A  Three-person  Cooperative  Game  (1) 

(a)  For  a  >  10  the  core  is  empty.  For  a  —  10,  a  core  element  is  for  instance  (0,5,5). 
Hence,  a  <  10. 

(b)  The  Shapley  value  is  ((25  —  2a)/ 6,  (19  +  a)/ 6,  (16  +  a)/ 6).  By  writing  down 
the  core  constraints,  it  follows  that  this  is  in  the  core  for— 13  <  a  <  8.75. 

(c)  At  this  vector,  the  excesses  of  the  three  two-player  coalitions  are  equal,  namely 
to  (a  —  1 4) / 3 .  For  this  tobe  the  nucleolus  we  need  that  the  excesses  of  the  one- 
person  coalitions  are  not  larger  than  this,  i.e., 

(2a  —  1 6)  /  3  <  (a-U)/3,(-a-4)/3  <  (a  -  14)/3,  (-a-  7)/3  <  (a-  14)/3 
and  it  is  straightforward  to  check  that  this  is  true  for  no  value  of  a. 

RP  40  A  Three-person  Cooperative  Game  (2) 

(a)  The  core  is  nonempty  for  a  <  1 .  In  that  case,  the  core  is  the  quadrangle  (or  line 
segment  if  a  —  1)  with  vertices  (1, 2,  2),  (< a ,  2,  3  —  a),  (1,  1, 3),  and  (a,  2  — a,  3). 

(b)  The  Shapley  value  is  ((2a  +  7)/6,(10  —  a)/6,  (13  —  a)/ 6),  which  is  in  the  core 
for  —  2  <  a  <  — 1/2. 

(c)  By  equating  the  excesses  of  the  two-person  coalitions  we  obtain  the  vector 
(2/3,  5/3,  8/3)  with  excess  —1/3.  This  is  the  nucleolus  if  a  —  2/3  <  —1/3, 
hence  if  a  <  1/3. 

RP  41  Voting 

(a)  The  winning  coalitions  (omitting  set  braces)  are  AB ,  AC,  ABC ,  ABD ,  ACD , 
ABCD ,  and  BCD.  Then  &A(v)  =  l!2!/4!  +  l!2!/4!  +  2!l!/4!  +  2!l!/4!  + 
2!  1  !/4!  =  5/12.  Similarly,  one  computes  the  other  values  to  obtain  <P(v)  = 
(1/1 2) (5,  3,  3, 1).  (In  fact,  it  is  sufficient  to  compute  <Pb(v)  and  <T>c(v).) 

(b)  pA  —  5,  pB  —  3,  pc  =  3 ,pD=  1;  /3(A)  =  5/12,  0(B)  =  3/12,  fi(C)  =  3/12, 

fW)  =  1/12. 

(c)  The  winning  coalitions  are  AB ,  AC,  ABC.  The  Shapley  value  is  (2/3, 1/6,  1/6). 
Further, pA  =  3 ,pB  =  pc  =  1;  /3(A)  =  3/5,  /3(B)  =  /3(C)  =  1/5. 

RP  42  An  Airport  Game 

(a)  i/(l)  =  i/(2)  =  i/(3)  =  0,  i/(12)  =  i/(13)  =  c\,  i/ (23)  =  C2,  and  v(N)  = 

Ci  +  C2- 

(b)  The  core  is  the  quadrangle  with  vertices  (ci,C2,0),  (0,C2,ci),  (0,ci,C2),  and 
(ci,0,c2). 
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(c)  0(v)  =  (l/6)(4ci,  3c2  +  c\ ,  3c2  +  c\).  This  is  a  core  element  (check  the 
constraints). 

(d)  The  nucleolus  is  of  the  form  (a,  (c\  +  c2  —  a)/ 2,  (c\  +  c2  —  a)/ 2).  By  equating 
the  excesses  of  the  two-person  coalitions  it  follows  that  a  —  (3c\  —  c2)/3,  hence 
the  nucleolus  would  be  ((3ci  —  c2)/ 3,  2c2/3, 2c2/3)  and  the  excess  of  the  two- 
person  coalitions  is  then  — c2/ 3.  We  need  that  the  excesses  of  the  one-person 
coalitions  are  not  larger,  that  is, —(3c i~c2)/ 3  <  — c2/3  and  —  (2/3)c2  <  — c2/ 3. 
This  results  in  the  condition  c\  >  2c2/3. 

RP  43  A  Glove  Game 

(a)  By  straightforward  computation,  <P(v)  —  (1  / 60) (39,  39,  14, 14, 14):  note  that 
it  is  sufficient  to  compute  one  of  these  values. 

(b)  C(v)  =  {(1,1,0, 0, 0)}. 

(c)  By  (b)  and  the  fact  that  the  nucleolus  is  in  the  core  whenever  the  core  is 
nonempty,  the  nucleolus  is  (1,1, 0,0,0,). 

RP  44  A  Four-Person  Cooperative  Game 

(a)  C(v)  =  {x  e  M4  |  x;  >  0  Wi,x\  +  v2  =  V3  +  X4  =  2,x\  +  V3  >  3}.  In  the 
intended  diagram,  the  core  is  a  triangle  with  vertices  (2, 1),  (2, 2),  and  (1,2). 

(b)  <P(v)  =  (1/4) (5,  3,  5,  3)  (it  is  sufficient  to  compute  one  of  these  values). 

RP  45  A  Matching  Problem 

(a)  The  resulting  matching  is  (x\ ,y4),  C*4,.yi). 

(b)  The  resulting  matching  is  (xx  ,y4),  (*2,^3),  (x3,yi),  (v4,y2). 

(c)  x\  prefers  y4  overyi  andy4  prefers  x\  overy4. 

(d)  In  any  core  matching,  v2  and  y3  have  to  be  paired,  since  they  are  each  other’s 
top  choices.  Given  this,  x\  and  y4  have  to  be  paired.  This  leaves  only  the  two 
matchings  in  (a)  and  (b). 

RP  46  House  Exchange 

(a)  There  are  two  core  allocations:  1  :  h\,  2  :  h3,  3  :  /z2  and  1  :  /i2, 2  :  h3,  3  :  h\. 

(b)  The  unique  top  trading  cycle  is  1,2,3,  with  allocation  1  :  /i2, 2  :  h3,  3  :  h\. 

(c)  Take  preference  h\,h2,h^  with  unique  core  allocation  1  :  h\,  2  :  h3,  3  :  /*2. 

RP  47  A  Marriage  Market 

(a)  m\  must  be  paired  to  his  favorite  woman  in  the  core.  Next,  m2  must  be  paired  to 
his  favorite  of  the  remaining  women,  etc. 

(b)  (mi, wi),  ( m2,w2 ),  (m3,w3),  (m4,w4). 

(c)  (mi,w4),  (m2,w3),  ( m3,w2 ),  (m4,Wi). 
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(d)  (mi ,  W2),  (m2,  w  1),  (m3,  W3),  (m4,  W4)  (one  can  reason  about  this  but  also  just  try 
the  six  possibilities). 
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